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Abstract. We propose a novel, type-elimination-based method for standard reasoning in the de- 
scription logic S9-(IQb s extended by DL-safe rules. To this end, we first establish a knowledge 
compilation method converting the terminological part of an 3{£.CIb knowledge base into an or- 
dered binary decision diagram (OBDD) that represents a canonical model. This OBDD can in turn 
be transformed into disjunctive Datalog and merged with the assertional part of the knowledge base 
in order to perform combined reasoning. In order to leverage our technique for full S9-(IQb s , we 
provide a stepwise reduction from S9-(IQb s to 3\.£.CIb that preserves satisfiability and entailment of 
positive and negative ground facts. The proposed technique is shown to be worst-case optimal w.r.t. 
combined and data complexity. 



1. Introduction 

Description logics (DLs, see Baader et all 120071 ) have become a major paradigm in Knowledge 
Representation and Reasoning. This can in part be attributed to the fact that the DLs have been 
found suitable to be the foundation for ontolo gy modeling and reasoning for the Semantic Web. In 
particular, the Web Ontology Language OWL (|W3C OWL Working Groupl . l2009l ). a recommended 
standard by the World Wide Web Consorti um (W3Cj3 for ontology modeling, is essentially a de- 
scription logic (see, e.g.. lHitzler et al, , 2009, for an introduction to OWL and an in-depth description 
of the correspondences). As such, DLs are currently gaining significant momentum in application 
areas, and are being picked up as knowledge representation paradigm by both industry and applied 
research. 
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The DL known as S'HIQ is among the most prominent DL fragments that do not feature nominalsEI 
and it covers most of the OWL language. Various OWL reasoners implement efficient reasoning 
support for S'HIQ by means of tableau methods, e.g., Pellet0FaCT++Oor RacerProJfl 
However, even the most efficient implementations of reasoning algorithms to date do not scale up 
to very data-intensive application scenarios. This motivates the search for alternative reasoning ap- 
proaches that build upon different methods in order to address case s where tableau algorithm s turn 
out to have certain wea knesses. Successfu l examples are KAON2 (Mo tik and Sattlei . 2006) based 



on resolution, HermiT (Moti k et all l2009h base d on hyper-tabl e aux, a s well as th e consequence 



based systems CB (IKazakovt , 120091) . ConDOR (Simancf k et all 1201 lh . and ELK (Kazakov et al 



201 1). Moreover, especially for l ightweight DLs, approac hes based on rewriting queries (ICalvanese et al 



2007al) or both queries and data (IKontchakov et al 



2010) have been proposed. 



In th i s pa per, we propose the use of a variant of type elimination, a notion first introduced by 
Pratt (1972), as a reasoning paradigm for DLs. To implement the necessary computations on large 



type sets in a compressed way, we suggest the use of ordered binary decision diagrams (OBDDs). 
OBDDs have been applied successfully in the domain of large-sca le model checkin g and verifica- 



tion, but have hitherto seen only little investigation in DLs, e.g., by iPan et al 
Most of the description logics considered in this article exhibit restricted Boolean role expressions 
as a non-standard modeling feature, which is indicated by a b or (if further restricted) b s in the 
name of the DL. In particular, we propose a novel method for reasoning in S'HIQbs knowledge 
bases featuring terminological and assertional knowledge including (in)equality statements as well 
as DL-safe rules. 

Our work starts by considering terminological reasoning in the DL S\£,CIb, which is less expressive 
than S'HIQbs. We introduce a method that compiles an JiUCIb terminology into an OBDD 
representation. Thereafter, we show that the output of this algorithm can be used for generating 
a disjunctive Datalog program that can in turn be combined with ABox data to obtain a correct 
reasoning procedure. Finally, the results for 3K£,CIb are lifted to full S'HIQbs by providing an 
appropriate translation from the latter to the former. 



This a rticle combines and consoli dates our previous work about pure TBox reasoning (Rudolp h et al. 
2008c), its extension to ABoxes (Rudolph et al., 2008b) and some notes on reasoning in DLs with 
Boolean role expressions (Rudolph et all 12008 al ) by 

• providing a collection of techniques for eliminating S'HIQbs modeling features that impede the 
use of our type elimination approach, 

• laying out the model-theoretic foundations for type-elimination-based reasoning for very expres- 
sive description logics without nominals, using the domino metaphor for 2-types, 

• elaborating the possibility of using OBDDs for making type elimination computationally feasible, 

• providing a canonical translation of OBDDs into disjunctive Datalog to enable reasoning with 
assertional information, and 

• making the full proofs accessible in a published version. 

Moreover, we extend our work by adding some missing aspects and completing the theoretical 
investigations by 



Nominals, i.e., concepts that denote a set with exactly one element, usually cause a reasoning efficiency problem 
when added to S'HIQ. This is evident from the performance of existing systems, and finds its theoretical justification in 
the fact that they increase worst-case complexity from ExpTime-completeness to NExpTime-completeness. 
http://clarkparsia.com/pellet/ 
4 http://owl. man.ac.uk/factplusplus/ 
5 http://www.racer-systems.com/ 
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extending the procedures for reducing SHIQbs to J[£jCIb to ABoxes and DL-safe rules, 
establishing worst-case optimality of our algorithms, 
extending the supp orted language: while our previous work only covered ter minological rea 
sonmg m SI-tlQ dRudolph et all 2008c) and combined reasoning in J{£jCIb dRudolph et al 



2008b), we now support reasoning in S'HIQbs knowledge bases featuring terminological and 



assertional knowledge, including (in)equality statements and DL-safe rules. 
The structure of this article is as follows. Section [2] recalls relevant preliminaries. Section [3] dis- 
cusses the computation of sets of dominoes that represent models of 3{£jCIb knowledge bases. 
Section 0] casts this computation into a manipulation of OBDDs as underlying data structures. Sec- 
tion [5] discusses how the resulting OBDD presentation can be transformed to disjunctive Datalog 
and establishes the correctness of the approach. Section[6]provides a transformation from S'HIQbs 
to 3i£jCIb, thereby extending the applicability of the proposed method to S'HIQbs knowledge 
bases. Section[7]discusses related work and Section[8]concludes. 



2. The Description Logics S'HIQbs and 3{£jCIb 

We first recall some basic definitions of DLs and introduce our notati on. A more gen tle first in- 
troduction to DLs, together with pointers to further reading, is given in Rudolph) ( 2011 ). Here, we 



define a rather expressive description lo gic S'HIQbs that extends S'HIQ with restricted Boolean 
role expressions (see, e. g.. iTobiesl 120011 ). 

Definition 2.1. A SHIQb s knowledge base is based on three disjoint sets of concept names He, 
role names N^, and individual names Nj. The set of atomic roles R is defined by R := N# U {R~ \ 
R e Nr}. In addition, we let Inv(/?) := R~ and Inv(7?~) := R, and we extend this notation also to 
sets of atomic roles. In the following, we use the symbols R and S to denote atomic roles, if not 
specified otherwise. 

The set of Boolean role expressions B is defined as 

B ::= R | -.B | B n B | B U B. 

We use h to denote entailment between sets of atomic roles and role expressions. Formally, given a 
set 51 of atomic roles, we inductively define: 

• for atomic roles R, 51 h R if R € 51, and 51 ¥■ R otherwise, 

• 51 h -.£/ if 51 y- U, and 51 y- -.£/ otherwise, 

• 51 h U n V if 51 h U and 51 h V, and 31 ¥■ U n V otherwise, 

• 51^UUV if 3L\-U or 3L\-V, and5lyUuV otherwise. 

A Boolean role expression U is restricted if Y- U. The set of all restricted role expressions is 
denoted by T, and the symbols U and V will be used throughout this paper to denote restricted role 
expressions. A SHIQb s RBox is a set of axioms of the form U E V (role inclusion axiom) or 
Tra(/?) (transitivity axiom). The set of non-simple roles (for a given RBox) is defined as the smallest 
subset of R satisfying: 

• If there is an axiom Tra(R), then R is non-simple. 

• If there is an axiom R E S with R non-simple, then 5 is non-simple. 

• If R is non-simple, then Inv(R) is non-simple. 

An atomic role is simple if it is not non-simple. In S'HIQbs, every non-atomic Boolean role 
expression must contain only simple roles. 

Based on a S'HIQbs RBox, the set of concept expressions C is defined as 

C ::= N c | T | _L | -.C | C n C | C U C | VT.C | 3T.C | <«RC | >{n + 1)R.C, 
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where n > denotes a natural number, and the role S in expressions ^nS.C and ^(n + l)S.C 
is required to be simple. Common names for the various forms of concept expressions are given 
in Table Q] (lower part). Throughout this paper, the symbols C, D will be used to denote concept 
expressions. A Sl-lIQbs TBox (or terminology) is a set of general concept inclusion axioms (GCIs) 
of the form CEO. 

Besides the terminological components, DL knowledge bases typically include assertional knowl- 
edge as well. In order to increase expressivity and to allow for a uniform presentation of our 
approach we generalize this by allowing knowledge bases to contain so-called DL-safe rules as 



introduced by lMotik et all (2005). 



Definition 2.2. Let V be a countable set of first-order variables. A term is an element of V U N/. 
Given terms t and u, a concept atom/role atom/equality atom is a formula of the form C(t)/R(t, u)/t « 
u with C e Nc and R e Nr. A DL-safe rule for S^HIQbs is a formula B — > H, where B and H are 
possibly empty conjunctions of (role, concept, and equality) atoms. To simplify notation, we will 
often use finite sets S of atoms for representing the conjunction A, 5 . 

A set CP of DL-safe rules is called a rule base. An extended S'HIQbs knowledge base KB is a triple 
(T, % CP>, where T is a S9iIQb s TBox, CR is a S<HIQb s RBox, and CP is a rule base. 

We only consider extended knowledge bases in this work, so we will often just speak of knowledge 
bases. In the literature, a DL ABox is usually allowed to contain assertions of the form A(a), R(a, b), 
or a « b, where a, b e N/, A e Nc, and R € N«. We assume that all roles and concepts occurring in 
the ABox are atomic^ These assertions can directly be expressed as DL-safe rules that have empty 
(vacuously true) bodies and a single head atom. Conversely, the negation of these assertions can be 
expressed by rules that have the assertion as body atom while having an empty (vacuously false) 
head. Knowing this, we will not specifically consider assertions or negated assertions in the proofs 
of this paper. For convenience we will, however, sometimes use the above notations instead of their 
rule counterparts when referring to (positive or negated) ground facts. 

As mentioned above, we will mostly consider fragments of S9{IQb s . In particular, an (extended) 
{RJjCIb knowledge base is an (extended) S9iIQb s knowledge base that contains no RBox axioms 
and no number restrictions (i.e., concept expressions <,nR.C or ^nR.C). Consequently, an extended 
{RJjCIb knowledge base only consists of a pair (7, CP), w here 7 is a TBox and CP is a rule base. The 
related DL JKLCQIb has been studied by lTobiesI d200ll) . 



The semantics of S1-lIQb s and its sublogics is defined in the usual, model-theoretic way. An 
interpretation I consists of a set A J called domain (the elements of it being called individuals) 
together with a function - 1 mapping individual names to elements of A J , concept names to subsets 
of A 1 , and role names to subsets of A J x A J . 

The function J is extended to role and concept expressions as shown in Table Q] An interpretation 
I satisfies an axiom <p if we find that I \= tp, where 

• I \= U E V if U 1 c V 1 , 

• I \= Jra(R) if R 1 is a transitive relation, 

• I \= C c D if C 1 c D 1 , 

I satisfies a knowledge base KB, denoted I \= KB, if it satisfies all axioms of KB. 

It remains to define the semantics of DL-safe rules. A (DL-safe) variable assignment Z for an 

interpretation J is a mapping from the set of variables V to {a 1 \ a e N/}. Given a term (eN;UV, 



^This common assumption is made without loss of generality in terms of knowledge base expressivity. It is essential 
for defining the ABox-specific complexity measure of data complexity, although it might be questionable in cases where 
ABox statements with complex concept expressions belong to the part of the knowledge base that is frequently changing. 
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Name 


S vntax 


Semantics 

kj V_. 111 CI. 1 1 I l\- L 7 


invprsp rolp 

1 1 1 V 1 I V 7 1 


R 


Ux v) e A J x A 1 1 <v x> e /? J 1 


role negation 






IU1C CUI1J UI1L-L1UI1 


TJ n V 

U 1 1 V 


C 1 1 V 




II I I V 


f/ J U V J 


top 


T 




bottom 


± 





negation 


-iC 




conjunction 


C n D 


C J n D J 


disjunction 


CUD 


C U L) 


universal restriction 


MU.C 


(i e A 1 | (x,j> e £/ J implies y e C 1 } 


existential restriction 


3U.C 


[x e A 1 | (x,y) e U 1 , y e C 1 for some y e A 1 } 


qualified 


<nS.C 


{x e A J | My e A J |(x, y> e S 1 , y e C 1 } < n\ 


number restriction 


>nS.C 


{x e A J | #{y e A J |<x, y) e S 1 , y e C 1 } > n} 



Table 1 : Semantics of constructors in S'HIQb s for an interpretation I with domain A 1 



we set t T ' Z := Z(7) if t e V, and ? J ' Z := ? J otherwise. Given a concept atom C(t) / role atom 
R(t, u) I equality atom (*«,we write I,Z \= C(t) / I,Z |= P(f, m) / J,Z |= ? « w if ? J - Z e C J / 
<? J,Z , w JZ ) eR 1 / t J - z - u T ' Z , and we say that J and Z satisfy the atom in this case. 
An interpretation I satisfies a rule B — > H if , for all variable assignments Z for I, either J and Z 
satisfy all atoms in H, or J and Z fail to satisfy some atom in B. In this case, we write I \= B — » H 
and say that J is a model for B — > H. An interpretation satisfies a rule base IP (i.e., it is a model for 
it) whenever it satisfies all rules in it. An extended knowledge base KB = (7, 01, "?) is satisfiable if it 
has an interpretation I that is a model for T, % and IP, and it is unsatisfiable otherwise. Satisfiability, 
equivalence, and equisatisfiability of (extended) knowledge bases are defined as usual. 
For convenience of notation, we abbreviate TBox axioms of the form T E C by writing just C. 
Statements such as I \= C and C e KB are interpreted accordingly. Note that C E D can thus be 
written as ->C U D. 

We often need to access a particular set of quantified and atomic subformulae of a DL concept 
expression. These specific parts are provided by the function P : C —> 2 C : 

P{D) if C - -.£>, 

P(D) U P(£) ifC = Dn£orC = Du£', 
{C} U P(D) if C = Q£/.D with D e {3, V, >n, <n}, 
{C} otherwise. 

We generalize P to DL knowledge bases KB by defining P(KB) to be the union of the sets P(C) 
for all TBox axioms C in KB, where we express TBox axioms as simple concept expressions as 
explained above. 

Given an extended knowledge base KB, we obtain its negation normal form NNF(KB) by keeping 
all RBox statements and DL-safe rules untouched and converting every TBox concept C into its 
negation normal form NNF(C) in the usual, recursively defined way: 



P(Q := 
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NNF(-iT) 
NNF(-.±) 
NNF(C) 
NNF(-i-iC) 
NNF(CnD) 
NNF(-.(C n D)) 
NNF(CuD) 
NNF(-.(C U D)) 



Cif Ce [A,-iA,T,±) 
NNF(C) 

NNF(C) n NNF(D) 
NNF(-.C) u NNF(-.D) 
NNF(C) u NNF(D) 
NNF(-.C) n NNF(-iD) 



NNF(V t/.C) 
NNF(-.Vf/.C) 
NNF(3t/.C) 
NNF(-,3t/.C) 
NNF««/?.C) 
NNF(-. ^nR.C) 
NNF(>ra/?.C) 
NNF(-i >nR.C) 



W.NNF(C) 
3t/.NNF(-,C) 
3t/.NNF(C) 
Vf/.NNF(-.C) 
<«/?.NNF(C) 
>(n + l)tf.NNF(C) 
>n/?.NNF(C) 
<(n- l)tf.NNF(C) 



It is well known that KB and NNF(KB) are semantically equivalent. 

In places, we will additionally require another well-known normalization step that simplifies the 
structure of KB by flattening it to a knowledge base FLAT(KB). This is achieved by transforming 
KB into negation normal form and exhaustively applying the following transformation rules: 

• Select an outermost occurrence of OU.D in KB, such that O € {3, V, <«, ^n] and D is a non- 
atomic concept. 

• Substitute this occurrence with OU.F where F is a fresh concept name (i.e., one not occurring in 
the knowledge base). 

• If € {3, V, >n}, add ->F U D to the knowledge base. 

• If O = add NNF(-iD) U F to the knowledge base. 

Obviously, this procedure terminates, yielding a flat knowledge base FLAT(KB) all TBox axioms 
of which are n, U-expressions over formulae of the form T, _L, A, ->A, or QUA with A an atomic 
concept name. Flattening is known to be a satisfiability-preserving transformation; we include the 
proof for the sake of self-containedness. 



Proposition 2.3. 

equisatisfiable. 



For every S'HIQbs knowledge base KB, we find that KB and FLAT(KB) are 



Proof. We first prove inductively that every model of FLAT(KB) is a model of KB. Let KB' be an 
intermediate knowledge base and let KB" be the result of applying one single substitution step to 
KB' as described in the above procedure. We now show that any model I of KB" is a model of KB'. 
Let OU.D be the concept expression substituted in KB'. Note that after every substitution step, the 
knowledge base is still in negation normal form. Thus, we see that OU.D occurs outside the scope 
of any negation or quantifier in a KB' axiom E', and the same is the case for OU.F in the respective 
KB" axiom E" obtained after the substitution. Hence, if we show that (OU.F) 1 c (OU.D) 1 , we can 
conclude that E" 1 c E' 1 . From I being a model of KB" and therefore E" 1 = A 1 , we would then 
easily derive that E' 1 = A 1 and hence find that I \= KB', as all other axioms from KB' are trivially 
satisfied due to their presence in KB". 

It remains to show (OU.F) 1 c (OU.D) 1 . To show this, consider some arbitrary 6 e (OU.F) 1 . We 
distinguish various cases: 

• O = >n 

Then there are distinct individuals 5\,...,6 n e A J with (6, <5,) e U 1 and 5, e F 1 for 1 < i < n. 
Since -ifuDe KB", we have I \= ->F U D, and therefore <5,- e D 1 for all the n distinct <5,-. Thus 
8 e On U.F) 1 . 

• D = <n 

Then the number of individuals 5' e A 1 with (8,6') e U 1 and 8' e F 1 is not greater than n. 
Since NNF(^D) U F e KB", we know D 1 c F 1 . Thus, also the number of individuals 8' e A 1 
with (8, 8') e U 1 and 8' e D 1 cannot be greater than n, leading to the conclusion 8 e (<« U.D) 1 . 
Hence, we have «n U.F) 1 c «n U.D) 1 . 
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The arguments for O = 3 and D = V are very similar, since these cases can be treated like >1 U.F 
and <0 U.-iF, respectively. Thus we obtain 8 e (OU.D) 1 in each case as required. 
For the other direction of the claim, note that every model I of KB can be transformed into a model 
J~ of FLAT(KB) by following the flattening process described above: Let KB" result from KB' by 
substituting OU.D by OU.F and adding the respective axiom. Furthermore, let I' be a model of 
KB'. Now we construct the interpretation I" as follows: F 1 " := (OU.D) 1 ' and for all other concept 
and role names N we set N 1 := N 1 . Then I" is a model of KB". □ 



3. Building Models from Domino Sets 

In this section, we introduce the notion of a set of dominoes for a given J{£jCIb TBox. Rules (and 
thus ABox axioms) will be incorporated in Section |5] later on. Intuitively, a domino abstractly rep- 
resents two individuals in an 3{£,CIb interpretation, reflecting their satisfied concepts and mutual 
role relationships. Thereby, dominoes are conceptually very similar to the c oncept of 2-types, a s 



used in investigations on two-variable fragments of first-order logic, e.g., by iGradel et al.l ([1997). 
We will see that suitable sets of such two-element pieces suffice to reconstruct models of 3{£jCIb, 
which also reveals certain model-theoretic properties of this not so common DL. In particular, every 
satisfiable Jl£CIb TBox admits tree-shaped models. This result is rather a by-product of our main 
goal of decomposing models into unstructured sets of local domino components, but it explains 
why our below constructions have some similarity with common approaches of showing tree-model 
properties by unraveling models. 

After introducing the basics of our domino representation, we present an algorithm for deciding 
satisfiability of an Jl£,CIb terminology based on sets of dominoes. 



3.1. From Interpretations to Dominoes. We now introduce the basic notion of a domino set, and 
its relationship to interpretations. Given a DL with concepts C and roles R, a domino over C c C 
is an arbitrary triple (A, ft, 55), where A, 55 c S and c R. In the following, we will always 
assume a fixed language and refer to dominoes over that language only. We now formalize the idea 
of deconstructing an interpretation into a set of dominoes. 

Definition 3.1. Given an interpretation I = (A 1 , - 1 ), and a set C c C of concept expressions, the 
domino projection of I w.r.t. C, denoted by 7Tq(I) is the set that contains, for all 8, 8' e A 1 , the triple 
(A, ft, 15) with 

A = {C ee\8eC T }, ft = [R e R | (8,8') e R 1 }, 55 = {C e 6 I 8' e C J ). 

It is easy to see that domino projections do not faithfully represent the structure of the interpretation 
that they were constructed from. But, as we will see below, domino projections capture enough 
information to reconstruct models of a TBox T, as long as C is chosen to contain at least P(T). For 
this purpose, we introduce the inverse construction of interpretations from arbitrary domino sets. 

Definition 3.2. Given a set D of dominoes, the induced domino interpretation J(B) - (A 1 , - 1 ) is 
defined as follows: 

(1) A 1 consists of all nonempty finite words over D where, for each pair of subsequent letters 
(A, ft, 55) and (A', ft', 55') in a word, we have S - A'. 

(2) For a word cr = (A\, ftL.A2X.A2, X,A^) . . . {Ai-\, ft,-i, Aj) and a concept name A e Nc, we 
define tail(cr) := Aj and set cr e A 1 iff A e tail(cr). 
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(3) For a role name R e N#, we set (en, <T2) e R if 

o- 2 - o-i (A, % "B) with R e ft or en = cr 2 <.A, 3?, 3) with Inv(tf) e ft. 

We can now show that certain domino projections contain enough information to reconstruct models 
of a TBox. 

Proposition 3.3. Consider a set 6 c C of concept expressions, and an interpretation J~, and let 
7C := I(7te(SD) denote the induced domino interpretation of the domino projection of $ w.r.t. C. 
Then, for any tAfJJIb concept expression C £ C with P(C) Q Q, we have that 3 |= C ijfK \= C. 
Especially, for any tR£CIb TBox 7, we have ST \= 7 iff T(Jip(T)(SD) N 7. 

Proof. Consider some C e C as in the claim. We first show the following: given any ^-individual 
8 and 7f -individual cr such that tail(o-) = {D e G | 8 e D J ], we find that cr e C % iff 8 e C J . 
Clearly, the overall claim follows from that statement using the observation that a suitable 8 e A 7 
must exist for all cr e and vice versa. We proceed by induction over the structure of C, noting 
that P(C) c e implies P(D) c 6 for any subconcept D of C. 

The base case C e Nc is immediately satisfied by our assumption on the relationship of 6 and cr, 
since C e P(C). For the induction step, we first note that the case C e {T, _L} is also trivial. For 
C = ->D and C = D n D' as well as C = D U D', the claim follows immediately from the induction 
hypothesis for D and D'. 

Next consider the case C = 3U.D, and assume that 5 e C-^. Hence there is some 5' e A^ such 
that (5, 8') e and 8' e . Then the pair (8, 8') generates a domino (A, % S) and A^ contains 
cr' = cr<yi, 3?, S). (5, 5') e £/ J implies 3i h U (by definition of h and due to the fact that "R contains 
exactly those R e R with (8, 8') e R^), and hence (cr, cr') e Applying the induction hypothesis 
to D, we conclude cr' e D . Now cr £ follows from the construction of 7(". 
For the converse, assume that cr e C^. Hence there is some cr' £ such that (cr,o-') e and 
cr' e D^. By the definition of "K, there are two possible cases: 

• cr' = cr(tail(cr), % tail(cr')) and 01 h c7: Consider the two J'-individuals (8', 8") generating the 
domino (tail(cr), 3?, tail(cr')). From cr' e and the induction hypothesis, we obtain 8" e Z>^. 
Together with (5', 8") e U J this implies 8' e O 7 . Since C = 3U.D e C, we also have C e tail(cr) 
and thus 8 e as claimed. 

• cr - o-'(tail(o-'), % tail(cr)) and Inv(3i) h U : This case is similar to the first case, merely exchang- 
ing the order of (8', 8") and using Inv(IR) instead of 3J. 

Finally, the case C = SU.D is dual to the case C = 3U.D, and we will omit the repeated argument. 
Note, however, that this case does not follow from the semantic equivalence of VU.D and -i3U.-iD, 
since the proof hinges upon the fact that ->D is contained in C which is not given directly. □ 

3.2. Constructing Domino Sets. As shown in the previous section, the domino projection of a 
model of an Jl£,CIb TBox can contain enough information for reconstructing a model. This ob- 
servation can be the basis for designing an algorithm that decides TBox satisfiability. Usually (es- 
pecially in tableau-based algorithms), checking satisfiability amounts to the attempt to construct a 
(representation of a) model. As we have seen, in our case it suffices to try to construct just a model's 
domino projection. If this can be done, we know that there is a model, if not, there is none. 
In what follows, we first describe the iterative construction of such a domino set from a given TBox, 
and then show that it is indeed a decision procedure for TBox satisfiability. 
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Algorithm 1 Computing the canonical domino set D<j of a TBox 7 

Input: 7 an JKLCIb TBox, C - P(FLAT(T)) 
Output: the canonical domino set Dj of T 

1: initialize Do as the set of all dominoes (A, 31, 33) over S satisfying: 

2: for all C e FLAT(T), the GCI HdsA D n riceey! -£> E C is a tautolog>Q (kb) 

3: for all SUA e 6 with A e 3 and 3? h £/, we have 3£/.A £ .A, (ex) 

4: for all Vt/.A e 6 with Vt/.A e .A and 3? h U, we have A e 3. (uni) 

5: i :=0 

6: repeat 

7: i:-i+l 

8: determine D, as the set of all dominoes (A, 31, 3) e D/_i satisfying: 

9: for all SUA e A, there is some (A, 31' , 3') e D ; _i with 31' h t/ and A e 3', (delex) 

10: for all Vt/.A € 6 \ A, there is some (A, 31' , 3') e B,-_i with 31' \- U but A g 3', (deluni) 

ll: <3,Inv(3?),yi> e Dj_i. (sym) 

12: until D ; = D,_i 

13: Do- := Dj 

14: return Do- 



Algorithm Q] describes the construction of the canonical domino set D-j of an J[£jCIb TBox 7. 
Thereby, roughly speaking, condition kb ensures that all the concept parts A and 3 of the con- 
structed domino set abide by the axioms of the considered TBox. The condition ex guarantees that, 
in every domino (A, 31, 3), the concept set A must contain all the existential concepts for which 
31 and 3 serve as witnesses. Conversely, uni makes sure that every universally quantified concept 
recorded in A is appropriately propagated to 3, given a suitable 31. Once enforced, the conditions 
kb, ex, and uni remain valid even if the domino set is reduced further, hence they need to be taken 
care of only at the beginning of the algorithm. In contrast, the conditions delex, deluni, and sym 
may be invalidated again by removing dominoes from the set, thus they need to be applied in an 
iterated way until a fixpoint is reached. Condition delex removes all dominoes with the concept 
set A if A contains an existential concept for which no appropriate "witness" domino (in the above 
sense) can be found in the set. Likewise, deluni removes all dominoes with the concept set A if A 
does not contain a universal concept which should hold given all the remaining dominoes. Finally, 
sym ensures that the domino set contains only dominoes that do have a "symmetric partner", i.e., 
one that is created by swapping A with 3 and inverting all of 31. 

Given that every domino (A, 31,33) satisfies A, 35 c Q and ScR, and that both S and R are linearly 
bounded by the size of 7, Do is exponential in the size of the TBox, hence the iterative deletion 
of dominoes must terminate after at most exponentially many steps. Below we will show that this 
procedure is indeed sound and complete for checking TBox satisfiability. Before that, we will show 
a canonicity result for Dj. 

Lemma 3.4. Consider an JKHCIb terminology 7 and an arbitrary model I of 7. Then the domino 
projection 7rp(FLAT(T))(-^") w contained in Dj. 

Proof. The claim is shown by a simple induction over the construction of Do-. In the following, 
we use {A, 31, 33) to denote an arbitrary domino of ttp(flat(o-))(-0- For the base case, we must show 
that 7Tp(FLAT(T))(-T) £ Do- Let (A, 31, 33) to denote an arbitrary domino of 7Tp(flat(T))(-D which was 

Please note that the formulae in FLAT(T) and in A c 6 are such that this can easily be checked by evaluating the 
Boolean operators in C as if A was a set of true prepositional variables. 
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generated from elements (6,6'). Then (A, 31, 23) satisfies condition kb, since 6 e C 1 for any C € 
FLAT(T). The conditions ex and uni are obviously satisfied. 

For the induction step, assume that fl>(FLAT(T))(-D £ B;, and let (A, 31, 23) again denote an arbitrary 
domino of 7i>(FLAT(T))(-0 which was generated from elements (6, 6'). 

• For delex, note that SUA e A implies 6 e {SUA) 1 . Thus there is an individual 6" such that 
(6,6") e U 1 and 6" e A 1 . Clearly, the domino generated by (6,6") satisfies the conditions of 
delex. 

• For deluni, note that VC/.A £ A implies 6 (SUA) 1 . Thus there is an individual 6" such that 
(6,6") e U 1 and 6" A 1 . Clearly, the domino generated by (6,6") satisfies the conditions of 
deluni. 

• The condition of sym for (A, 31, 23) is clearly satisfied by the domino generated from (6', 6). 
Therefore, the considered domino (A, 31, 23) must be contained in D,-+i as well. □ 

Note that, in contrast to tableau procedures, the presented algorithm starts with a large set of domi- 
noes and successively deletes undesired dominoes. Indeed, we will soon show that the constructed 
domino set is the largest such set from which a domino model can be obtained. The algorithm 
thus may seem to be of little practical use. In Section [4] we therefore refine the above algorithm to 
employ Boolean functions as implicit representations of domino sets, such that the efficient compu- 
tational methods of OBDDs can be exploited. In the meantime, however, domino sets will serve us 
well for showing the required correctness properties. 

An important property of domino interpretations constructed from canonical domino sets is that 
the (semantic) concept membership of an individual can typically be (syntactically) read from the 
domino it has been constructed of. 

Lemma 3.5. Consider an 3\£.CIb TBox T with nonempty canonical domino set Dj, and define 
C := P(FLAT(T)) and I = (A 1 , - 1 ) := J(D T ). Then, for all C € 6 and cr e A 1 , we have that cr e C 1 
iffCe tail(cr). Moreover, I \= FLAT(T). 

Proof. First note that the domain of I is nonempty whenever Dj is. Now if C e Nc is an atomic 
concept, the first claim follows directly from the definition of I. The remaining cases that may 
occur in P(FLAT(T)) are C - SUA and C - SUA. 

First consider the case C = SUA, and assume that cr e C 1 . Thus there is cr' e A 1 with (cr, cr') e U 1 
and cr' e A 1 . The construction of the domino model admits two possible cases: 

• cr' = cr(tail(cr), % tail(cr')) with 31 h U and A e tail(cr'). Since B 7 c D , we find that 
(tail(cr), % tail(cr')) satisfies condition ex, and thus C e tail(cr) as required. 

• cr - cr^tahXcr'), 3?, tail(cr)) with Inv(IR) h U and A e tail(cr'). By condition sym, Bg- also contains 
the domino (tail(cr), Inv(IR), tail(cr')), and we can again invoke ex to conclude C € tail(cr). 

For the other direction, assume SUA e tail(cr). Thus I}j must contain some domino {A, % tail(cr)), 
and by sym also the domino (tail(cr),Inv(IR),.A). By condition delex, the latter implies that Do- 
contains a domino (tai\(cr),J{',A'). According to delex, we find that cr' = cr(tail(cr), 31', A') is an 
J-individual such that (cr, cr') e U 1 and cr' e A 1 . Thus cr 6 (SUA) 1 as claimed. 
For the second case, consider C = ~iU.A and assume that cr e C 1 . Then contains some domino 
(A, 31, tail(cr)), and by sym also the domino (tail(cr), lnv(3l),A). For a contradiction, suppose that 
V£/.A £ tail(cr). By condition deluni, the latter implies that Dj contains a domino (tail(cr),3l' ,A'). 
According to deluni, we find that cr' = cr(tail(cr), 31' , A') is an J-individual such that (cr, cr') e U 1 
and cr' £ D 1 . But then cr £ (SUA) 1 , yielding the required contradiction. 

For the other direction, assume that VU.A e tail(cr). According to the construction of the domino 
model, there are two possible cases for elements cr' with (cr, cr') e U 1 : 
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• cr' - cr(tail(cr), % tail(cr')) with R h U. Since Dj c Do, (tail(cr), % tail(cr')) must satisfy condi- 
tion uni, and thus A € tail(cr'). 

• cr - cr'(tail(cr'), 3?, tail(cr)> with Inv(3l) h U. By condition sym, Dj also contains the domino 
(tail(cr),Inv(3l),tail(o"')), and we can again invoke uni to conclude A e tail(cr'). 

Thus, A € tail(cr') for all [/-successors cr' of cr, and hence cr e Q/U.A) 1 as claimed. 

For the rest of the claim, note that any domino (A, 01, 03) must satisfy condition kb. Using condition 
sym, we conclude that for any cr € A , the axiom | | Dgtaii(<r) D E C is a tautology for all C € 
FLAT(T). As shown above, cr e D 1 for all D e tail(cr), and thus cr e C. Hence every individual of 
I is an instance of each concept of FLAT(T) as required. □ 

The previous lemma shows soundness of our decision algorithm. Conversely, completeness is 
shown by the following lemma. 

Lemma 3.6. Consider an J{£jCIb TBox 7. If T is satisfiable, then its canonical domino set Bj is 
nonempty. 

Proof. This is a straightforward consequence of Lemma 13.41 given a model I of T, the domino 
projection 7rp(FLAT(T))(-T) is nonempty and (by Lemma [3~4l > contained in Bj. Hence Dj is nonempty. 

□ 

We now are ready to establish our main result on checking TBox satisfiability and the complexity 
of the given algorithm: 

Theorem 3.7. An ^R£jCIb TBox T is satisfiable iff its canonical domino set D<j is nonempty. Al- 
gorithm [7] thus describes a decision procedure for satisfiability of J{£J2Ib TBoxes. Moreover, the 
algorithm runs in exponential time and hence is worst-case optimal. 



Proof. The first proposition of the theorem is a direct consequence of Lemma [331 Proposition 
(page [6]), and Lemma [331 

For worst-case optimality, recall that Sf{IQb s is ExpTiME-complete (see Rudolph et all 2008a, 



tor worst-case optimaiity, recall mat c> nl(db s is fc,xp 1 iME-complete ( see IKudolpn et all IzUUoa 
where ExpTiME-hardness already directly follows from the results by SchildT 199 lh . Now, consid 



ering the presented algorithm, we find that the set C = P(FLAT(T)) is linearly bounded by the size 
of 7, whence the size of the set of all dominoes is exponentially bounded by |T|. Applying the 
conditions kb, ex, and uni to obtain Do can be done by subsequently checking every domino, each 
check taking at most 0(|T|) time, hence the overall time for that step is exponentially bounded. Now, 
consider the iterated application of the delex, deluni, and sym conditions. By the same argumenta- 
tion as for kb, ex, and uni, one iteration takes exponential time. On the other hand, each iteration 
step reduces the domino set by at least one domino (otherwise, the termination criterion would be 
satisfied) which gives us a bound of exponentially many steps. Finally note that exponentially many 
exponentially long steps still yield a procedure that is overall exponentially bounded. □ 



4. Sets as Boolean Functions 

The algorithm of the previous section may seem to be of little practical use, since it requires com- 
putations on an exponentially large set of dominoes. The required computation steps, however, can 
also be accomplished with an indirect representation of the possible dominoes based on Boolean 
functions. Indeed, every propositional logic formula represents a set of propositional interpretations 
for which the function evaluates to true. Using a suitable encoding, each propositional interpretation 
can be understood as a domino, and a propositional formula can represent a domino set. 
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As a representation of prepositional formulae well-proven in other contexts, we use binary decision 
diagrams (BDDs). These dat a structures have b een used to represent complex Boolean functions 
in model-checking (see, e.g., iBurch et al. [ [l990h . A particular optimization of these structures are 
ordered BDDs (OBDDs) that use a dynamic precedence order of prepositional variables to obtain 
compressed representations. We provide a first intr oduction to OBDDs b elow. A more detailed 
exposition and further literature pointers are given by Hu m and RvarJd2000h . 



4.1. Boolean Functions and Operations. We first explain how sets can be represented by means 
of Boolean functions. This will enable us, given a fixed finite base set S , to represent every family 
of sets S c 2 s by a single Boolean function. 

A Boolean function on a set Var of variables is a function <p : 2 Var — > {true, false). The underlying 
intuition is that <p(V) computes the truth value of a Boolean formula based on the assumption that 
exactly the variables of V are set to true. A simple example are the functions florae]] and \false\ 
that map every input to true or false, respectively. Another example are so-called characteristic 
functions of the form §y\ f° r some v 6 Var, which are defined as [vjk^V) := true iff v e V. 
Boolean functions over the same set of variables can be combined and modified in several ways. 
Especially, there are the obvious Boolean operators for negation, conjunction, disjunction, and im- 
plication. By slight abuse of notation, we will use the common (syntactic) operator symbols -i, A, 
V, and — > to also represent such (semantic) operators on Boolean functions. Given, e.g., Boolean 
functions <p and ijj, we find that (tp A if/)(V) = true iff <p(V) = true and iff(V) = true. Note that the 
result of the application of A results in another Boolean function, and is not to be understood as a 
syntactic logical formula. 

Another operation on Boolean functions is existential quantification over a set of variables V Q 
Var, written as 3V.ip for some function tp. Given an input set W c Var of variables, we define 
(3V.(p)(W) = true iff there is some V c V such that tp(V U (W \ V)) = true. In other words, 
there must be a way to set truth values of variables in V such that tp evaluates to true. Universal 
quantification is defined analogously, and we thus have W.cp := -iSV.-xp as usual. Mark that our use 
of 3 and V overloads notation, and should not be confused with role restrictions in DL expressions. 



4.2. Ordered Binary Decision Diagrams. Binary Decision Diagrams (BDDs), intuitively speak- 
ing, are a generalization of decision trees that allows for the reuse of nodes. Structurally, BDDs are 
directed acyclic graphs whose nodes are labeled by variables from some set Var. The only exception 
are two terminal nodes that are labeled by true and false, respectively. Every non-terminal node has 
two outgoing edges, corresponding to the two possible truth values of the variable. 

Definition 4.1. A BDD is a tuple — (N, n roo \, 72true> ft false* low, high, Var, A) where 

• N is a finite set called nodes, 

• " root e N is called the root node, 

• ft true* n false e N are called the terminal nodes, 

• low, high : Af \ {«true^faise} - * N are two child functions assigning to every non-terminal node 
a low and a high child node. Furthermore the graph obtained by iterated application has to be 
acyclic, i.e., for no node n exists a sequence of applications of low and high resulting in n again. 

• Var is a finite set of variables. 

• A : N\{n\ rue , «f a isel — * Var is the labeling function assigning to every non-terminal node a variable 
from Var. 



TYPE-ELIMINATION-BASED REASONING FOR SWQb s 



13 



OBDDs are a particular realization of BDDs where a certain ordering is imposed on variables to 
achieve more efficient representations. We will not require to consider the background of this opti- 
mization in here. Every BDD based on a variable set Var = {x\,...,x n } represents an n-wy Boolean 
function tp : 2 Var — > {true,false}. 

Definition 4.2. Given a BDD O = (N, n r0Q \, rctrue* "false* low, high, Var, A) the Boolean function tpo • 
2 Var — > {true, false) is defined recursively as follows: 

¥>0 := fn, M <Pn ttue = ItrueJ p nfalse - tfalsej 

<-Pn = \-ilMn)\ A ¥>low(")) V A ^high(n)) for n € N \ {n Xrue , Wfaise} 

In other words, the value <p(V) for some V C Var is determined by traversing the BDD, starting from 
the root node: at a node labeled with v e Var, the evaluation proceeds with the node connected by 
the high-edge if v e V, and with the node connected by the low-edge otherwise. If a terminal node 
is reached, its label is returned as a result. 

BDDs for some Boolean formulas might be exponentially large in general (compared to |Var|), 
but often there is a representation which allows for BDDs of manageable size. Finding the opti- 
mal representatio n is NP-complete, but heuristics have shown to yield good approximate solutions 



(Wegener. 120041) . Hence (O)BDDs are often conceived as efficiently compressed representations 
of Boolean functions. In addition, many operations on Boolean functions - such as the aforemen- 
tioned negation, conjunction, disjunction, implication as well as propositional quantification - can 
be performed directly on the corresponding OBDDs by fast algorithms. 



4.3. Translating Dominos into Boolean Functions. To apply the above machinery to DL rea- 
soning, consider a flattened Jl£,CIb TBox T = FLAT(T). A set of propositional variables Var is 
defined as Var := RU(P(T) x {1, 2}). We thus obtain a bijection between dominoes over the set P(T) 
and sets V C Var given by {A,%"B) h-» (A x {1}) U 3? U (23 x {2)). Hence, any Boolean function 
over Var represents a domino set as the collection of all variable sets for which it evaluates to true. 
We can use this observation to rephrase the construction of Dj in Algorithm \T\ into an equivalent 
construction of a function |[T]|. 

We first represent DL concepts C and role expressions U by characteristic Boolean functions over 
Var as follows. 



EC] := 



-iD] if C = -D 

ID] A IE J if C = D n E 

ID] VIE J if C = D U E 

l(C,l)] x ifCeP(7) 



IU] := 



IVJ AlWJ if U = V n W 
WJ VlW] if U = V U W 



if U € R 



We can now define a decision procedure based on Boolean functions, as displayed in Algorithm |2] 
This algorithm is an accurate translation of Algorithm \T\ where the intermediate Boolean functions 
ip kb ,ip ex ,tp um ,^ elex ,ip^ elum ,(p s j ym represent domino sets containing all dominoes satisfying the re- 
spective conditions from Algorithm [T] By computing their conjunction with each other (and, for 
the latter three, with the Boolean function representing the domino set from the previous iteration) 
we intersect the respective domino sets which results in their successive pruning as described in 
Algorithm Q] The algorithm is a correct procedure for checking consistency of J[£jCIb TBoxes 
as unsatisfiability of 7 coincides with |[T| = false. Note that all necessary computation steps can 
indeed be implemented algorithmically: Any Boolean function can be evaluated for a fixed variable 
input V, and equality of two functions can (naively) be checked by comparing the results for all 
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Algorithm 2 Computing the boolean representation |[T| of the canonical domino set Dj of a TBox 
Input: 7 an fUjCIb TBox, 6 = P(FLAT(T)) 

Output: the canonical domino set of T, represented as Boolean function [[TJJ 
1: <p kb := /\ICJ 

Ce7 

2: if™ := /\ IW.C, l)\ A lUJ -» E<C,2>^ 
VC/.CeP(T) 

3: ^ ex /\ [<C, 2>]k, A [t/][ |[<3I/.C, 1)^ 

3U.CeP(7) 

4: |[T]|o := <p kb A ^ uni A ^ ex 

5: i :=0 

6: repeat 
7: i:-i+l 

8: ^f elex := /\ U3U.C, l)J x 3(R U ex{2}).(CT,-_i A [[£/] A 1{C,2)\) 

3U.CeP(7) 

9: pf e,uni := /\ KVU.C, l)\ -> -3(R U ex{2}).(CT,-_i A [[£/] A ->l(C,2)l x ) 

VC/.C€P(T) 

10: ^ ym (V) := CT;_i({<D, 1) I <D, 2)eF|U |Inv(/?) | 7? € V} U |<D, 2> | (D, 1) e V}) 

11: UAi := Uli-l A ^f elex A ^f eluni A ^ 

12: until [[TD, = IfTBj-i 

B: m ■= mi 

14: return [[0J 



possible input sets (which are finitely many since Var is finite). The algorithm terminates since the 
sequence is decreasing w.r.t. {V | ITJJ^V) = true), and since there are only finitely many Boolean 
functions over Var. 

Proposition 4.3. For any 3{£CIb TBox 7 and variable set V e Var as above, we find that ITJJ(V) = 
true iff V represents a domino in B<j as defined in Definition \J} 

Proof. It is easy to see that the Boolean operations used in constructing [[TJ directly correspond to 
the set operations in Definition [T] such that [fTTJ(V) - true iff V represents a domino in Dkb- D 

All required operations and checks are provided by standard OBDD implementations, and thus can 
be realized in practice. 

In the remainder of this section, we illustrate the above algorithm by an extended example to which 
we will also come back to explain the later extensions of the inference algorithm. Therefore, con- 
sider the following J{£CIb knowledge base XH. 

PhDStudent E ~3has .Diploma 

Diploma E Vhas~ .Graduate 

Diploma n Graduate E _L 

Diploma(laureus) PhDStudent(laureus) 

For now, we are only interested in the terminological axioms, the consistency of which we would 
like to establish. As a first transformation step, all TBox axioms are transformed into the following 
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Figure 1: OBDDs arising when processing the terminology of JCB; following traditional BDD no- 
tation, solid arrows indicate high successors, dashed arrows indicate low successors, and 
the topmost node is the root 



universally valid concepts in negation normal form: 

-^PhDStudent U 3has. Diploma -^Diploma U Vhas~ .Graduate -^Diploma U -^Graduate 

The flattening step can be skipped since all concepts are already flat. Now the relevant concept 
expressions for describing dominoes are given by the set 

P(7) = {3has. Diploma, Vhas~. Graduate, Diploma, Graduate, PhDStudent}. 

We thus obtain the following set Var of Boolean variables (although Var is just a set, our presentation 
follows the domino intuition): 



(3has. Diploma, 1) 


has 


(3has. Diploma, 2) 


(V has" .Graduate, 1) 


has~ 


(V has' .Graduate, 2) 


(Diploma, 1) 




(Diploma, 2) 


{Graduate, 1) 




(Graduate, 2) 


(PhDStudent, 1) 




(PhDStudent, 2) 



We are now ready to construct the OBDDs as described. Figure Q] (left) displays an OBDD corre- 
sponding to the following Boolean function: 

<p kb ■- (^(PhDStudent, \)\ V l(3has .Diploma, \)\) 
f\(->^(Diploma, l)J x V |[(V has" .Graduate, l)J x ) 
A(->l{Diploma, 1)J X V ^(Graduate, 
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and Fig. Q] (right) shows the OBDD representing the function |[T]|o obtained from cp by conjunc- 
tively adding 

<P ex - -i^{Diploma, 2) J x V ->^has^ x V ^{3has. Diploma, l)J x and 
ip um = -i^(Vhas~ .Graduate, \)J X V -iflTjas - ]]^ V ^(Graduate, 2)J X . 

Then, after the first iteration of the algorithm, we arrive at an OBDD representing [[Tfli which is 
displayed in Fig. |2] This OBDD turns out to be the final result |[TJ. The input TBox is derived to 
be consistent since there is a path from the root node to 1. 




Figure 2: Final OBDD obtained when processing using notation as in Fig. [TJ arrows to the 
node have been omitted for better readability 
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5. Reasoning with ABox and DL-Safe Rules via Disjunctive Datalog 

The above algorithm does not yet take any assertional information about individuals into account, 
nor does it cover DL-safe rules. The proof of Theorem 13 .7 I hinges upon the fact that the constructed 
domino set D j induces a model of the terminology T, and Lemma 13.41 states that this is indeed 
the greatest model in a certain sense. This provides some first intuition of the problems arising 
when ABoxes are to be added to the knowledge base: J[£jCIb knowledge bases with ABoxes do 
generally not have a greatest model. 



We thus employ disjunctive Datalog (see Eiter et all. 1 1 9971) as a paradigm that allows us to incor 



porate ABoxes into the reasoning process. The basic idea is to forge a Datalog program that - 
depending on two given individuals a and b - describes possible dominoes that may connect a and 
b in models of the knowledge base. There might be various, irreconcilable such dominoes in differ- 
ent models, but disjunctive Datalog supports such choice since it admits multiple minimal models. 
As long as the knowledge base has some model, there is at least one possible domino for every pair 
of individuals (possibly without connecting roles) - only if this is not the case, the Datalog program 
will infer a contradiction. Another reason for choosing disjunctive Datalog is that it allows for the 
straightforward incorporation of DL-safe rules. 

We use the OBDD computed from the terminology as a kind of pre-compiled version of the relevant 
terminological information. ABox information is then considered as an incomplete specification of 
dominoes that must be accepted by the OBDD, and the Datalog program simulates the OBDD's 
evaluation for each of those. 

Definition 5.1. Consider an extended J{£CIb knowledge base KB = (7, IP), and an OBDD O = 
(N, «root> "true, "false* low, high, Var, A) that represents the function |[T]| as defined by Algorifhm[2] A 
disjunctive Datalog program DD(KB) is defined as follows. DD(KB) uses the following predicates: 

• a unary predicate Sc for every concept expression C € P(FLAT(T)), 

• a binary predicate S R for every atomic role R e N Rl 

• a binary predicate A„ for every OBDD node n e N, 

• the equality predicate ~. 

The constants in DD(KB) are the individual names used in IP. The disjunctive Datalog rules of 
DD(KB) are defined as follows0 

(1) For every DL-safe rule B — > H from RB, DD(KB) contains the rule obtained from B — > H by 
replacing all C(x) by Sc(x) and all R(x,y) by S R (x,y). 

(2) DD(KB) contains rules -» A„ root (x,y) and A„ false (jc,y) 

(3) If ?i e ;V with A(n) = (C, 1) then DD(KB) contains rules 

S c (x) A A n (x,y) -» A h ig h( „)(x,y) and A n (x,y) -» A\ ow(n) (x,y) V S c (x). 

(4) IfneN with A(n) = (C, 2) then DD(KB) contains rules 

S c (y) A A n (x,y) -> A high( , l) (x,y) and A n (x,y) -» A low{n) (x,y) v S c (y). 

(5) If n e N with A(n) = R for some R e N R then DD(KB) contains rules 
S R (x,y) AA n (x,y) -> A hi g h („)(x,y) andA„(x,y) -> A| 0wW (x,y) V S R (x,y). 

(6) If n e N with A{ri) = R~ for some R € M R then DD(KB) contains rules 
S R (y, x) A A n (x,y) -> A high(n) (x,y) and A n {x,y) -> A low(n) (x,y) V S R (y, x). 

Note that the arity of predicates in DD(KB) is bounded by 2. Hence, the number of ground atoms 
is quadratic with respect to the number of constants (individual names), whence the worst-case 
complexity for satisfiability checking is NP w.r.t. the number of individuals (and especially w.r.t. 



Note that we use disjunctive Datalog with equality. However, every disjunctive Datalo g program w ith equality can 
be reduced to one without equality in linear time, as equality can be axiomatized (see, e.g., lFittingill996h . 
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the number of facts) , as opposed to the NExpTime complexity of disjunctive Datalog in general 
(IDantsin et all 120011) . Note that, of course, DD(KB) may still be exponential in the size of KB 
in the worst case: DD(KB) is linear in the size of the underlying OBDD which in turn may have 
exponential size compared to the set of propositional variables used in the represented Boolean 
functions. Finally the number of these variables is linearly bounded by the size of KB. It remains 
to show the correctness of the Datalog translation. 

Lemma 5.2. Given an extended JKJiCIb knowledge base KB such that I is a model of KB, there 
is a model ST of DD(KB) such that 

• I \= C(a) iff J N S c (a), 

• I \= R(a, b) iff |= SR(a, b), and 

• I\=a~b iff J \=a~b. 

for any a,b eN;, C £ Nc, and R £ N#. 

Proof. Let KB = (T, CP). We define an interpretation $ of DD(KB). The domain of J~ contains 
the named individuals from I, i.e., = {a 1 \ a e N/}. For individuals a, we set a^ := a 1 . The 
interpretation of predicate symbols is now denned as follows (note that A% is defined inductively 
on the path length from « root to n): 

• 6 e iff 8 e C 1 

• <<5i,<5 2 > eS^iff (8i,8 2 ) £ # J 

• <5 1 ,5 2 )eA^ oot foralU 1 ,5 2 eA^ 

• (81,82) € A„ for n + « root if there is a node n' such that (8\, 62) £ and one of the following 
is the case: 

- A(n') - (C, i), for some i £ {1, 2}, and n = Iow(ti') and Si g C 1 

- A(n') - {C, i), for some i £ {1, 2}, and n = high(«') and 5, e C 1 

- A(n') =Randn = low(n') and (S 1 ,6 2 ) <t R 1 

- A(n') =Randn = high(n') and (Si , 5 2 ) £ R 1 

Mark that, in the last two items, R is any role expression from Var, i.e., a role name or its inverse. 
Also note that due to the acyclicity of O, the interpretation of the A-predicates is indeed well- 
defined. We now show that J is a model of DD(KB). To this end, first note that the extensions of 
predicates Sc and Sr in J were defined to coincide with the extensions of C and R on the named 
individuals of I. Since I satisfies CP, all rules introduced in item (OQ) of Definition 15. H are satisfied by 
J~. The restriction of DL-safe rules to named individuals can be discarded here since A-7 contains 
only named individuals from A J . 

Similarly, we find that the rules of cases ©H© are satisfied by J~. Consider the first rule of (O, 
Sc(x) A A n (x,y) — > Ahigh(«)(^,} ; ), and assume that 8\ e and (61,62) £ A„ . Thus 61 £ C 1 . Using 
the preconditions of and the definition of J~, we conclude that (61,82) £ ^n^ n y The second 
rule of case ([3]> covers the analogous negative case. All other cases can be treated similarly. 
Finally, for case (0, we need to show that A^ a|se = 0. For that, we first explicate the correspondence 
between domain elements of I and sets of variables of O. Given elements 81,82 £ A J we define 
Vs u s 2 := {(C, n)\Ce P(FLAT(T)), 5 n £ C 1 } U {R \ (81, 8 2 ) £ R 1 }, the set of variables corresponding 
to the J-domino between 81 and 82- 

Now A^ a|se = clearly is a consequence of the following claim: for all 81,82 £ A J and all n e N, 
we find that (81,82) £ A„ implies f n (Vs u s 2 ) = true (using the notation of Definition I4.2I ). The proof 
proceeds by induction. For the case n = n roo i, we find that </>„ root = |[TJ. Since V^,^ represents a 
domino of I, the claim thus follows by combining Proposition 14.31 and Lemma [3741 
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For the induction step, let n be a node such that (61 ,62) eA„ follows from the inductive definition of 
J based on some predecessor node n' for which the claim has already been established. Note that n' 
may not be unique. The cases in the definition of J~ must be considered individually. Thus assume 
n' , n, and 6\ satisfy the first case, and that (61,62) £ A n . By induction hypothesis, <Pn'(Vg l j 2 ) = true, 
and by Definition 14. 2 1 the given case yields <p n (Vs u $ 2 ) = true as well. The other cases are similar. □ 

Lemma 5.3. Given an JlUCIb knowledge base KB such that fj is a model o/DD(KB), there is a 
model I of KB such that 

• I \= C(a) iff J \= S c (a), 

• I \= R(a, b) iff J \= S R {a, b), and 

• 1 1= a » b iff J \=a~b, 

for any a,b £ N/, Ce Nc, andR £ N#. 

Proof. Let KB = (T, J 3 ). We construct an interpretation I whose domain A J consists of all se- 
quences starting with an individual name followed by a (possibly empty) sequence of dominoes 
from Dj such that, for every cr e A J , 

• if o- begins with a(A, ft, 2>, then {C | C e P(FLAT(T)), a J £ S%] = A, and 

• if cr contains subsequent letters (A, ft, S> and (A', ft', £'), then "B = A'. 

For a sequence cr - a(A\, ( 3i\,A2)(A2, 3?2>^3) ■ ■ ■ (A-i, ftj-i, A), we define tail(cr) := .A,-, whereas 
for a o- - a we define tail(cr) := {C | C € P(FLAT(T)), a J e S^\. Now the mappings of I are 
defined as follows: 

• for a € N/, we have a 1 := a, 

• for A e N c , we have cr £ A J iff A £ tail(cr), 

• for /? € N#, we have (o-\, o~ 2 ) £ 7? J if one of the following holds 

- cf\ - a £ N/ and o~ 2 = b £ N/ and (a, b) £ 5^, or 

- cr 2 = tri<A, ft, 2> with /? £ ft, or 

- 0-1 = o- 2 <.A, ft, 2> with Inv(7?) £ ft. 

Thus, intuitively, I is constructed by extracting the named individuals as well their concept (and 
mutual role) memberships from J~, and appending an appropriate domino-constructed tree model 
to each of those named individuals. We proceed by showing that I is indeed a model of KB. 
First note that the definition of I ensures that, for all individual names a, b £ N/, we indeed have 
1 1= C(a) iff J |= S c (a), I \= R(a, b) iff J |= S*(a, ft), and J |= a « ft iff J" |= a « ft. Therefore, the 
validity of the rules introduced via case (Q]) ensures that J is a model of ft. 

For showing that the TBox is also satisfied, we begin with the following auxiliary observation: for 
every two individual names a, ft £ N/, and ft aft := {R | (a J ,b J > £ S%\ U {Inv(rt) | <ft J ,a^ > £ }, 
the domino (tail(a), ft a fo, tail(ft)) is contained in ~Dj (Claim f). Using Proposition 14.31 it suffices to 
show that the Boolean function |[T] if applied to V a ,b '■- {tail(a) x {1} U R a b U tail(ft) x {2}} yields 
true. Since |[T]| = <p„ root , this is obtained by showing the following: for any a, ft £ N/, we find 
that (aP ' ,b&) £ A„ implies <p n (V a ,b) = true. Indeed, (t) follows since we have (a?,b^) £ A^ mo{ 
due to the first rule of © in Definition 15.11 We proceed by induction, starting at the leafs of the 
OBDD. The case (a, ft) £ Al is immediate, and (a, ft) £ Al is excluded by the second rule of 

x ' ' "true ' \ > / ifalse j 

([2]). For the induction step, consider nodes n, n' £ N such that either A(n) £ V a ,b and n' = high(«), 
or A(n) £ V a ,b and n' = Iow(ti). We assume that (a^, bP) £ A„ , and, by induction, that the claim 
holds for n' . If A n = (C, 1), then one of the rules of case ([3]> applies to a^ and bP . In both cases, we 
can infer (a? ,b&) £ A~\ and hence tp n '(V a ,b) = true. Together with the assumptions for this case, 
Definition l4.2l implies <p n (V a jb) = true, as required. The other cases are analogous. This shows (f). 
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Now we can proceed to show that all individuals of I are contained in the extension of each concept 
expression of FLAT(T). To this end, we first show that cr e C 1 iff C e tail(cr) for all C € P(FLAT(T)). 
If C € He is atomic, this follows directly from the definition of I. The remaining cases that may 
occur in P(FLAT(T)) are C - SUA and C = Vc/A. 

First consider the case C = 3U.A and assume that cr e C 1 . Thus there is cr' e A J with (cr, cr') e U 1 
and cr' e A 1 . The construction of the domino model admits three possible cases: 

• cr,cr' e N/ and 3cVo-' h U and A € tail(cr'). Now by (f), the domino <tail(cr), 3cVo-',tail(cr')) 
satisfies condition ex of Algorithm [Q and thus C e tail(cr) as required. 

• cr' = cr(tail(cr), % tail(cr')) with 31 h U and A £ tail(cr'). Since D T £ D , we find that 
(tail(cr), 31, tail(cr')) satisfies condition ex, and thus C e tail(cr) as required. 

• cr - cr'(tail(cr'), 31, tail(cr)) with Inv(IR) h t7 and A e tail(cr'). By condition sym, B<j contains the 
domino (tail(cr), Inv(3l), tail(cr')), and again we use ex to conclude C e tail(cr). 

For the converse, assume that 3U.A e tail(cr). So Dj contains a domino (A, 31, tail(cr)). This is 
obvious if the sequence cr ends with a domino. If cr = a e N/, then it follows by applying (f) to 
a with the first individual being arbitrary. By sym also contains the domino (tail(cr), 31, A). By 
condition delex, the latter implies that D-j contains a domino (tail(cr), 31', A') such that 31' h U and 
A € A'. Thus cr' = cr(taH(cr),Ol',A') is an J-individual such that (cr,cr') e U 1 and cr' e A J , and 
we obtain cr e (3t/.A) J as claimed. 

For the second case, consider C = VC/.A and assume that cr e C J . As above, we find that Dg- 
contains some domino (A, 3c, tail(cr)), where (f) is needed if cr e N/. By sym we find a domino 
(tail(cr), 31, A). For a contradiction, suppose that Vf/.A £ tail(cr). By condition deluni, the latter 
implies that I}j contains a domino (tail(cr), 31', A') such that 31' \- U and A £ .A'. Thus cr' = 
cr^aiKcr)^',^') is an J-individual such that (cr,cr') e U 1 and cr' g A J . But then cr g (Vt/.A) J , 
which is the required contradiction. 

For the other direction, assume that V£/A e tail(cr). According to the construction of I, for all 
elements cr' with (cr, cr') e £/ J , there are three possible cases: 

• cr, cr' e N/ and 3cVo-' i- £J Now by (t), the domino (tail(cr), 3l cro ./,tail(o" / )) satisfies condition uni, 
whence A e tail(cr'). 

• cr' - cr<tail(cr), % tail(cr')) with 31 \- U. Since D T c D , (tail(cr), 3c, tail(cr')) must satisfy condi- 
tion uni, and thus A e tail(cr'). 

• cr - cr'(tail(cr'),3l, tail(cr)) with Inv(3l) h U. By condition sym, D7 also contains the domino 
(tail(cr), Inv(3l), tail(cr')), and we can again use uni to conclude A £ tail(cr'). 

Thus, A e tail (cr') for all {/-successors cr' of cr, and hence cr e (Vc/.A) J as claimed. 
To finish the proof, note that any domino (A, 3c, 33) € Dj satisfies condition kb. Using sym, we 
have that for any cr e A 1 , the axiom rirj e taii(o-) D E C is a tautology for all C € FLAT(T). As shown 
above, cr e D 1 for all D e tail(cr), and thus cr e C 1 . Hence every individual of I is an instance of 
each concept of FLAT(T) as required. □ 

Lemmas [5.21 and [531 give rise to the following theorem which finishes the technical development of 
this section by showing that DD(KB) faithfully captures both positive and negative ground conclu- 
sions of KB, and in particular that DD(KB) and KB are equisatisfiable. 

Theorem 5.4. For every extended 3\JJ2Ib knowledge base KB hold 

• KB and DD(KB) are equisatisfiable, 

• KB |= C(a) zJDD(KB) |= S c (a), 

• KB |= R(a,b) jJDD(KB) |= S R (a,b), and 

• KB |= a « b iff DD(KB) |= a » b, 
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for any a,b e N/, C € Nc, andR e Nr. 

Proof. Immediate from Lemma [5T21 and Lemma [531 □ 

Coming back to our example knowledge base %'B from Section HJ the corresponding disjunctive 
Datalog program DD(3CB) contains 70 rules: two rules for each of the 33 labeled nodes from the 
OBDD displayed in Fig. (2], the two rules — > A nrM (x,y) and A„ false (;e, y) — > as well as the two rules — > 
SDipioma(laureus) and — > SphDStudent(laureus) introduced by conceiving the two ABox statements as 
DL-safe rules and translating them accordingly. The program turns out to be unsatisfiable, witnessed 
by the unsatisfiable subprogram displayed in Fig. [3] 



A$(x,y) A Sshas.Diplomaix) 
A { (x,y) A Sp hD Student{x) 
M{x,y) A S\/ has- .Graduated) 

A%{x,y) A S Graduate (y) 
A 9 (x,y) A S Graduate (y) 

An(x,y) A S Diplomat) 

A u (x,y) A S Dip i oma (y) 

Al(,{x,y) A S^has. Diplomat) 
A2t)(x,y) A SphDStudentiy) 
Aia\se(x,y) 



S Dip i oma (laureus) 
Ao(x,y) 

A 5 {x,y) A () {x,y) 
A\ a \s e (x,y) 

A 9 (x,y) A 5 (x,y) 

An(x,y) As{x,y) 

A l3 (x,y) A 9 (x,y) 
Afa\se(x,y) 
Afa\se(x,y) 

Af a \se(x,y) A l6 (x,y) 
Afa\se(x,y) 



SphDStudent(laureus) 

A\{X,y) V S^has.Diplomaix) 

A%{x,y) V S\/has- .Graduated) 
An(x,y) V S Gra duateiy) 

A 16 (x,y) V S Graduate (y) 

A2(){x,y) V Sshas.Diplomaiy) 



Figure 3: Unsatisfiable subprogram of DD(XB) witnessing unsatisfiability of %"B 



6. Polynomial Transformation from S ( HIQb s to JUlGIb 

In this section, we present a stepwise satisfiability-preserving transformation from the description 
logic S'T-iIQb s to the more restricted S\£jCIb. This transformation is necessary as our type- 
elimination method applies directly only to the latter. 



6.1. Unravelings. For our further considerations, we will use a well-known model transformation 
technique which will come handy for showing equisatisfiability of knowledge base transf ormations 



introd uced later on (for an introductory account on unravelings in a DL setting cf., e.g.. iRudolph 



( 201ll V). Essentially, the transformation takes an arbitrary model of a SI-lIQbs knowledge base and 



converts it into a model that is "tree-like". We start with some preliminary definitions. The first one 
exploits that role subsumption on non-simple roles can be decided by an easy syntactic check that 
takes only role hierarchy axioms into account. 

Definition 6.1. Based on a fixed S'HIQbs knowledge base KB, we define E* as the smallest binary 
relation on the non-simple atomic roles R n such that: 

• RE;* R for every atomic role R, 

• Re* S and Inv(7?) E* Inv(,S) for every RBox axiom Re S , and 

• Re* T whenever Re* S and S E* T for some atomic role S . 
Furthermore, we write Re* S whenever Re* S and S R. 
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The next definition introduces a standard model transformation techniqu e that is often used t o show 



variants of the tree model property of a logic. We adopt the definition of iGlimm et alj (120071) . 



Definition 6.2. Let KB be a consistent extended SI-UQbs knowledge base, and let I = (A J , J ) be 
a model for KB. 

The unraveling of I is an interpretation that is obtained from I as follows. We define the set 
S c (A J )* of sequences to be the smallest set such that 

• for every a e N/, a 1 is a sequence; 

• 8\ ■ ■ ■ 5 n ■ 8 n+ \ is a sequence, if 

- 5\ • • • 5 n is a sequence, 

- 6{+i + for all i = 2 n, 

- (6 n , (5„+i> e R 1 for some R e N R . 

For each cr = 6i • ■ • 8 n e S , set last(cr) := 8 n . Now, we define the unraveling of I as the interpretation 
3 - (A^ , -3) with = S and we define the interpretation of concept and role names as follows 
(where cr, cr' e A^ are arbitrary sequences in A-^): 

(a) for each a € N/, set := a 1 ; 

(b) for each concept name A e Nc, set cr e iff last(cr) e A J ; 

(c) for each role name R e N«, set (cr, cr') e /f^ iff 

• cr' = crd for some 5e A 1 and (last(cr), last(cr')) e 7? J or 

• cr = cr'5 for some 5e A 1 and (last(cr), last(cr')) e 7? J or 

• cr - a J , cr' = b 1 for some d,kN/ and (a 1 , b 1 ) e 7? J . 

Unraveling a model of an extended S1-lIQb s knowledge base results in an interpretation that still 
satisfies most of the knowledge base's axioms, except for transitivity axioms. The following defini- 
tion provides a "repair strategy" for unravelings such that also t he transitivity conditions are again 



satisfied. The presented definition is inspired by a similar one by iMotikJ ((2006). 



Definition 6.3. Given an interpretation I and a knowledge base KB, we define the completion of I 
with respect to KB as the new interpretation J~ = (A^, as follows: 

• A^ := A J , 

• a.3 :— a 1 for every a eNj, 

• A J := A 1 for every A e N c , 

• for all simple roles R, we set := R 1 , 

• for all non-simple roles R, R^ is set to the transitive closure of R 1 if Tra(/?) e KB, otherwise 
Rj - Rl u U Sc .r with Tra(5 )e KB or Tra(inv(5)) e KB( 5 J )*> where ( S ^* denotes the transitive closure 
ofS 1 . 

Having the above tools at hand, we are now ready to show that unraveling and subsequently com- 
pleting a model of an extended knowledge base will result in a model. This correspondence will be 
helpful for showing the completeness of the knowledge base transformation steps introduced below. 

Lemma 6.4. Let KB be an extended S'HIQb s knowledge base and let I be a model of KB. More- 
over, let $ be the unraveling of I and let 7C be the completion of \J~. Then the following hold: 

(1) J satisfies all axioms of KB that are not transitivity axioms. 

(2) For all sequences (T\,o~i, . . . , o~ n -\,o~ n with n > 3 and (cr,-, cr i+ i) 6 for 1 < i < n, and where 
cr\,cr n € {a? | a € N/} and cr 2 , . . . , cr n -\ $ {a^ \ a € Nj}, we have o~\ - cr n and cr 2 - cr n -\. 

(3) 7C is a model of KB. 
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Proof. For the first claim, we investigate all the possible axiom types. First, as I and J~ coincide 
w.r.t. concept and role memberships of all named individuals (i.e., individuals o~ for which o~ = a 1 
for some a e N/), they satisfy the same DL-safe rules. 

For role hierarchy axioms U E V with U, V restricted, suppose for a contradiction that $ does not 
satisfy U E V, i.e., that there are two elements cr, cr' £ A^ such that (cr, cr') e but (cr, cr') V^. 
As U is restricted, either both cr and cr' are named individuals or cr' = crS or cr = cr'c>. Therefore 
we know that (last(cr), last(cr')) € U 1 but (last(cr), last(cr')) £ V 1 which would violate U E V and 
hence, gives a contradiction. 

Next, we consider TBox axioms (remember that we assume them to be normalized into axioms 
TEC with C in negation normal form). By induction on the role depth, we will show that for 
every concept D it holds that cr e iff last(cr) e D 1 . The satisfaction of T E C in J~ then directly 
follows via A J = {cr e A J | last(cr) e A 1 } = {cr £ A J | last(cr) £ C 1 } = C J . 

As base case, note that for D £ Nc, the claim follows by definition, while for D = T and D = ± 
the claim trivially holds. For the induction steps, note that (i) the claimed correspondence trans- 
fers immediately from concepts to their Boolean combinations and (ii) that for every cr £ A^, the 
function last(-) gives rise to an isomorphism tp between the neighborhood of cr in $ and the neigh- 
borhood of last(cr) in I. More precisely, (p maps {cr' £ A^ \ (cr,cr') € for some/? e R} to 
{6' € A 1 | (last(cr), 5') e R 1 for some ReR] such that (cr, cr') e S J iff <last(cr), tp(cr')) e S 1 for all 
roles 5 e N^as well as cr' e iff (£>(<x') e E 1 for concepts £ that have a smaller role depth than D 
(by induction hypothesis). Thereby, the claimed correspondence transfers to existential, universal, 
and cardinality restrictions as well. 

For the second claim, we observe that by the definition of the unraveling, no individual cr - 6\ . . .8k 
can be directly connected by some role to an individual cr' - 5' { . . . 5\ with 8\ + 5' { unless k = I - 1 
in which case both individuals would be named by construction. On the other hand, every role chain 
starting from some named individual 6 and not containing any other named individual contains only 
individuals of the form 5w with w € (A J )*. Thus, we conclude that cr\ - cr n . Now, suppose 
cr 2 ± cr n -\. By construction we have cr 2 - cr\5 and cr n -\ = cr n 5' - cr\5' with 8 + 6'. However, 
then by construction, every role path from cr 2 to cr n -\ must contain cr\ which is named and hence 
contradicts the assumption. Therefore cr 2 = cr n -\. 

Considering the third claim, we easily find that all transitivity axioms as well as role hierarchy 
statements are satisfied by construction. For the TBox axioms, the argumentation is similar to 
the one used to prove the first claim but it has to be extended by the following observation: By 
construction, for all new role instances (cr,cr') e R^ \ R^ introduced by the completion, there is 
already a cr* with (cr,cr*) £ R J such that (cr,cr*) £ S J iff (cr,cr') e S 1 for all roles S £ N R as 
well as cr* £ E^ iff cr' £ E 1 for concepts E. Therefore (and since non-simple roles are forbidden 
in cardinality constraints) the concept extensions do not change in 7f compared to Finally, the 
DL-safe rules are valid: Due to the first claim they hold in J~. Then, they also hold in 7f since, by 
construction and J~ coincide when restricted to named individuals. In order to see the latter, note 
that J also coincides with I w.r.t. named individuals and I satisfies all transitivity axioms, thus the 
completion does not introduce new role instances, as far as named individuals are concerned. □ 



6.2. From S'HI Qb, to JjZCH IQb. As observed bv lRudolph et all d2008ah . a slight generaliza- 
tion of results by iMotik (2006) yields that any S'HIQbs knowledge base KB can be transformed 
into an equisatisfiable ZRHCHIQb knowledge base. For the case of extended knowledge bases, this 
transformation has to be adapted in order to correctly treat the entailment of ground facts R(a, b) for 
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non-simple roles R via transitivity. We start by defining this modified transformation, whereby the 
ground fact entailment is taken care of by appropriate DL-safe rules. 

Definition 6.5. Let c/(KB) denote the smallest set of concept expressions where 

• NNF(-.C U D) e c/(KB) for any TBox axiom C E D, 

• D e c/(KB) for every subexpression D of some concept C e c/(KB), 

• NNF(-.C) E c/(KB) for any ^nR.C e c/(KB), 

• VS.C e c/(KB) whenever Tra(S) e KB and S E* R for a role /? with WR.C e c/(KB). 

Finally, let ©^(KB) denote the extended knowledge base obtained from KB by removing all transi- 
tivity axioms Tra(/?) and 

• adding the axiom VR.C E VR.QJR.C) to KB whenever VR.C e c/(KB), 

• adding the axiom 3(R n R~).T E Self R to KB, where Self R is a fresh concept, 

• adding the DL-safe rules Self R (x) — > R(x, x) and /?(.x, y), R(y, z) — > z) to KB. 

Note that the knowledge base translation defined by ©s can be done in polynomial time. We now 
show that the defined transformation works as expected, making use of the mod el trans formation 
techniques established in the previous section. Parts of the proof are adopted from lMotik d2006h . 



Proposition 6.6. Let KB be an extended S1-lIQb s knowledge base. Then KB and ©s(KB) are 
equisatisfiable. 

Proof. Obviously, every model I of KB is a model of ©^(KB) if we additionally stipulate Self R := 
{6\(6,6)eR 1 }. 

For the other direction, let 7f be a model of ©^(KB). Let now I be the unraveling of 7f and let J~ 
be the completion of I w.r.t. KB. As ©^(KB) does not contain any transitivity statements, we know 
by Lemma l6"4l (l) that J is a model of ©s(KB) as well. 

As a direct consequence of the definition of the completion, note that for all simple roles V we have 
y3 = V 1 (fact f). 

We now prove that $ is a model of KB by considering all axioms, starting with the RBox. Every 
transitivity axiom of KB is obviously satisfied by the definition of J. Moreover, every role inclusion 
FEW axiom is also satisfied: 

If both V and W are Boolean role expressions (which by definition contain only simple roles) this 
is a trivial consequence of (f). If V is a Boolean role expression and W is a non-simple role, this 
follows from (f ) and the fact that, by construction of 3", we have R 1 c for every non-simple role 
R. As a remaining case, assume that both V and W are non-simple roles. If W is not transitive, this 
follows directly from the definition, otherwise we can conclude it from the fact that the transitive 
closure is a monotone operation w.r.t. set inclusion. 

We proceed by examining the concept expressions C E c/(KB) and show via structural induction 
that C 1 £ . As base case, for every concept of the form A or -u4 for A e Nc this claim follows 
directly from the definition of J~. We proceed with the induction steps for all possible forms of a 
complex concept C (mark that all C £ c/(KB) are in negation normal form): 

• Clearly, if D| c D~ and E by induction hypothesis, we can directly conclude {D\ n 
D 2 ) J £ (Di n D 2 ) J as well as (D { U D 2 ) J £ (D\ U D 2 ) J . 

• Likewise, as we have V 1 £ for all simple role expressions and non-simple roles V and again 
D 1 £ due to the induction hypothesis, we can conclude (3V.D) 1 £ (3V.D)^ as well as 
(>n V.D) 1 £ (>n V.D) J . 

• Now, consider C = VV.D. If V is a simple role expression, we know that = V 1 , whence we 
can derive (W.D) J £ (W.D) J from the induction hypothesis. 
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It remains to consider the case C = V/?.D for non-simple roles R. Assume cr e QJR.D) f. If 
there is no cr' with (cr, cr') e R^ , then cr € (WR.D)^ is trivially true. Now assume there are such 
cr'. For each of them, we can distinguish two cases: 

- (cr, cr') € /? J , implying cr' € D J and, via the induction hypothesis, cr' e LP, 

- (cr, cr') i R 1 . Yet, by construction of J~, this means that there is a role 5 with 5 E* R and 
Tra(5 ) e KB and a sequence cr - cr , . . . ,cr n - cr' with (cr^, cr^ > € S 1 for all < k < n. Then 
cr e (VR.D) 1 implies cr e (V5.D) J , and hence e D J . By Definition O ©.s(KB) contains 
the axiom VS.D E VS.(VS.D), and hence en e (VS.D) J . Continuing this simple induction, we 
find that cr* e D J for all & = 1, . . . ,n including cr,, = cr'. 

So we can conclude that for all such cr' we have cr' e D 1 . Via the induction hypothesis follows 

cr € Lfl and hence we can conclude cr e (WR.D)^. 
• Finally, consider C = ^nR.D and assume cr e (^nR.D) 1 . From the fact that R must be simple 

follows R3 = R 1 . Moreover, since both D and NNF(-iD) are contained in c/(KB) the induction 

hypothesis gives = D 1 . Those two facts together imply cr e (^nR.D) 1 . 
Now considering an arbitrary KB TBox axiom C E D, we find NNF(-iC UD) 1 = A 1 as I is a model 
of KB. Moreover - by the correspondence just shown - we have NNF(-.C U D) 1 c NNF(-iC U D) J 
and hence also NNF(->C U D)^ = making CEflan axiom satisfied in J~. 
For showing that all DL-safe rules from KB are satisfied, we will prove that I and J~ coincide 
on the satisfaction of all ground atoms - satisfaction of KB in J then follows from satisfaction of 
KB in I. By construction, this is obviously the case for all atoms of the shape a « b, C(a) and 
R(a,b) for a,b e N/, C e Nc and 7? e N# simple. Moreover we have that J~ \= R(a,b) whenever 
I \= R(a, b). To settle the other direction, suppose R non-simple and J (= R(a, b) but I ^ R(a, b). 
But then, there must be a role S E* R that is declared transitive and satisfies J~ \= S(a,b) but 
I fc£ S(a,b). Let us assume that S is a minimal such role w.r.t. E*. Then, by construction, there 
must be a sequence a 1 = cri,cr 2 , . . . , cr^-i,^ = b 1 with {o-j,crj + i) e S 1 . This sequence can be 
split into subsequences at elements of for which there is a o ( - € N/, i.e., at named individuals, 
leaving us with subsequences (i) of subsequent named individuals of, of j or (ii) of the shape of = 
o~i,i,cr i2 , - . - , cr in _\,cri n - of x with cr, 2, ■ ■ ■ , cr unnamed individuals. For case (ii), Lemma l6~4l 
(2) guarantees of - of +l and cr ( 2 = cr,-,,^, which implies of € (3(R n /? _ ).T) J . Then, due to 
the according axiom 3(R n /?~).T E Self R in 0s(KB), we obtain of e and by the DL-safe 

rule Self R (x) — > x) we have (of, of) 6 Hence, we know that /?(o ; -, o;+i) holds in J for 
all our subsequences of ...of +1 . But then, a (possibly iterated) application of the DL-safe rule 
7?(x, y) A 7?(y, z) — » z) also yields that R(a, b) is valid in I, contradicting our assumption. This 
finishes the proof. □ 



6.3. From ftLCHIQb to JKWHIb^. We now show how any extended JKLCHIQb knowledge 
base KB can be transformed into an extended JKHCHIb^ knowledge base 0j.(KB). The difference 
between the two DLs is that the latter does not allow > number restrictions. This transformation 
(as well as the one presented in Section [631 ) makes use of the Boolean role constructors and differs 
conceptually and t echnically from another m e thod f or removing qualified number restrictions from 
DLs described by lDeGiacomo and Lenzerinil (1 1994) . 

Given an J{£CHIQb knowledge base KB, the WZCHIb^ knowledge base 0>(KB) is obtained by 
first flattening KB and then iteratively applying the following procedure to FLAT(KB), terminating 
if no > restrictions are left: 

• Choose an occurrence of U.A in the knowledge base. 
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• Substitute this occurrence by 3R\.A n . . . n 3R n .A, where /?i, ...,/?„ are fresh role names. 

• For every i e {1, . . . ,n}, add /?, E U to the knowledge base's RBox. 

• For every 1 < i < k < n, add V(/? ( - n /?&)■ J- to the knowledge base. 

Observe that this transformation can be done in polynomial time, assuming a unary encoding of the 
numbers n. It remains to show that KB and @>(KB) are indeed equisatisfiable. 

Lemma 6.7. Let KB be an extended JLCCHTQb knowledge base. Then we have that the extended 
^RJLCHIb^ knowledge base 0^(KB) and KB are equisatisfiable. 

Proof. First we prove that every model of @^(KB) is a model of KB. We do so by an inductive 
argument, showing that no additional models can be introduced in any substitution step of the above 
conversion procedure. Hence, assume KB" is an intermediate knowledge base that has a model 
I, and that is obtained from KB' by eliminating the occurrence of ^n U.A as described above. 
Considering KB", we find due to the KB" axioms V(7?, n Rk)~L that no two individuals 6,6' e A 1 
can be connected by more than one of the roles R\, . . . ,R n . In particular, this enforces 6' ± 6", 

whenever (6,6') e R 1 and (6,6") e R 1 for distinct R, and /?,. Now consider an arbitrary 6 e 

i j 

(3i?i.A n .. . n 3R n .A) 1 . This ensures the existence of individuals 6\,. . .,6 n with (6,6i) e R. and 
6j € A 1 for 1 < i < n. By the above observation, all such 6j are pairwise distinct. Moreover, 
the axioms /?, E U ensure (6, <5 ; ) e U 1 for all i, hence we find that 6 e (^n U.A) 1 . So we know 
(37? i. A n . . . n 3R I1 .A) 1 c (>?i U.Cy . From the fact that both of those concept expressions occur 
outside any negation or quantifier scope (as the transformation starts with a flattened knowledge 
base and does not itself introduce such nestings) in axioms D" e KB" and D' e KB' which are 
equal up to the substituted occurrence, we can derive that D" 1 c D' 1 . Then, from D" 1 = A 1 
follows D' 1 = A 1 making D' valid in I. Apart from D', all other axioms from KB' coincide with 
those from KB" and hence are naturally satisfied in I. So we find that J is a model of KB'. 
At the end of our inductive chain, we finally arrive at FLAT(KB) which is equisatisfiable to KB by 
Proposition 12.31 

Second, we show that 0^(KB) has a model if KB has. By Proposition [231 satisfiability of KB entails 
the existence of a model of FLAT(KB). Moreover, every model of FLAT(KB) can be transformed to 
a model of ©^(KB), as we will show using the same inductive strategy as above by doing iterated 
model transformations following the syntactic knowledge base conversions. Again, assume KB" 
is an intermediate knowledge base obtained from KB' by eliminating the occurrence of ^n U.A as 
described above, and suppose J is a model of KB'. Based on I, we now (nondeterministically) 
construct an interpretation J~ as follows: 

• A^ := A 1 , 

• for all C £ N c , let C J := C 1 , 

• for all 5 6 N R \ {/?,■ | 1 < i < n\, let S J := S 1 , 

• for every 6 € (^n U.A) 1 , choose pairwise distinct ef, . . . , with (6, e^) £ U 1 and e A 1 (their 
existence being ensured by 6's aforementioned concept membership) and let R? := {(6, e 3 ) | 6 e 
(>n U.A) 1 }. 

Now, it is easy to see that J satisfies all newly introduced axioms of the shape V(/?,- n Rt)--L, as the 
ef have been chosen to be distinct for every 6. Moreover the axioms /?,• E U are obviously satisfied 
by construction. Finally, for all <5 e 0*n U.A) 1 the construction ensures 6 e {3R\.A n . . . n 3R n .A)^ 
witnessed by the respective ef. So we have (^n U.A) 1 c (3/?i.A n . . . n 3R n .A)3 ' . Now, again 
exploiting the fact that both of those concept expressions occur in negation normalized universal 
concept axioms D' e KB' and D" e KB" that are equal up to the substituted occurrence, we 
can derive that D' 1 c D" J . Then, from D' 1 = A 1 follows D" J = A J making D" valid in J. 
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Apart from D' (and the newly introduced axioms considered above), all other axioms from KB" 
coincide with those from KB' and hence are satisfied in J~, as they do not depend on the R[ whose 
interpretations are the only ones changed in J~ compared to J. So we find that J is a model of 
KB". ' ' □ 



6.4. From ^R£jCHIb^ to JiJLCIb^. In the presence of restricted role expressions, role subsump- 
tion axioms can be easily transformed into TBox axioms, as the subsequent lemma shows. This 
allows to dispense with role hierarchies in ZRHCHIb^ thereby restricting it to JU^CIb**. 

Lemma 6.8. For any two restricted role expressions U and V, the RBox axiom U E V and the TBox 
axiom V(t/ n ->V).J- are equivalent. 

Proof. By the semantics' definition, U E V holds in an interpretation I exactly if for every two 
individuals 5, 5' with (5, 8') e U 1 it also holds that (6, 8') e V 1 . This in turn is the case if and only if 
there are no 6, 5' with (6, 5') e U 1 but (6, 5') £ V T (the latter being expressible as (6, 6') e (-. V) 1 ). 
This condition can be formulated as (U n ->V) 1 - 0, which is equivalent to V(£/ n -> V).±. □ 

Note that U n -iV is restricted (hence an admissible role expression) whenever U is - this can be 
seen from the fact that ¥■ U implies ¥■ U n ->V due to the definition of I- and the Boolean role 
operator n. Consequently, for any extended JU^CHIb^ knowledge base KB, let @^(KB) denote 
the Jmcib 4, knowledge base obtained by substituting every RBox axiom U E V by the TBox 
axiom V(U n -iV)._L. The above lemma assures equivalence of KB and @^(KB) (and hence also 
their equisatisfiability). Obviously, this reduction can be done in linear time. 

6.5. From JiX.CIb^ to 3\££ITb. The elimination of the < concept descriptions from an ex- 
tended JillCIb^ knowledge base is more intricate than the previously described transformations. 
Thus, to simplify our subsequent presentation, we assume that all Boolean role expressions U oc- 
curring in concept expressions of the shape <,n U.C are atomic, i.e. U e R. This can be easily 
achieved by introducing a new role name Ry and substituting U.C by <«/?j/.C as well as adding 
the two TBox axioms V(t/ n -i/?j/).± and V(— iZ7 n Ru). ± (this ensures that the interpretations of U 
and Ru always coincide). 

To further make the presentation more conceivable, we subdivide it into two steps: first we eliminate 
concept expressions of the shape <,nR.C merely leaving axioms of the form <1/?.T (also known 
as role functionality statements) as the only occurrences of number restrictions, hence obtaining an 
JiLCITb knowledge base0 Then, in a second step discussed in the next section, we eliminate all 
occurrences of axioms of the shape <1 R.T. 

Let KB an 3l£CZb< knowledge base. We obtain the tRLCITb knowledge base 0<(KB) by first 
flattening KB and then successively applying the following steps (stopping when no further such 
occurrence is left): 

• Choose an occurrence of the shape <jiRA which is not a functionality axiom <1 R.T, 

• substitute this occurrence by V(7? n n . . . n -iR„).-iA where R\, . . . ,R n are fresh role names, 

• for every i e {1, . . . ,n}, add V/?,-.A as well as <1 Rj.T to the knowledge base. 

This transformation can clearly be done in polynomial time, again assuming a unary encoding of 
the number n. We now show that this conversion yields an equisatisfiable extended knowledge base. 
Structurally, the proof is similar to that of Lemma 16771 



Following the notational convention, we use T to indicate the modeling feature of role functionality. 
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Lemma 6.9. Given an extended JiXjCIb^ knowledge base KB, the extended jftJLCI'Fb knowledge 
base @^(KB) and KB are equisatisfiable. 

Proof. KB and FLAT(KB) are equisatisfiable by Proposition 12.31 so it remains to show equisatisfia- 
bility of FLAT(KB) and ©^(KB). 

First, we prove that every model of ©^(KB) is a model of FLAT(KB). We do so in an inductive 
way by showing that no additional models can be introduced in any substitution step of the above 
conversion procedure. Hence, assume KB" is an intermediate knowledge base with model I, and 
that is obtained from KB' by eliminating the occurrence of <ji R.A as described above. Now consider 
an arbitrary 6 e (V(7? n ->Ri n . . . n -i/?„).-A) J . This ensures that whenever an individual 6' e A 1 
satisfies (6,6') € R 1 and 6' e A, it must additionally satisfy (6,6') € Rj for one i € {\,...,n}. 
However, it follows from the KB"-axioms <1/?,.T that there is at most one such 6' for each R(. 
Thus, there can be at most n individuals 6' with (6, 6') e R 1 and 6' e A. This implies 6 e (^nRA) 1 . 
So we have (V(7? n ->Ri n . . . n -•/?„). -A) J c (^nRA) 1 . Due to the flattened knowledge base 
structure, both of those concept expressions occur outside the scope of any negation or quantifier 
within axioms D" e KB" and D' e KB' that are equal up to the substituted occurrence. Hence, we 
can derive that D" 1 c D' 1 . Then, from D" 1 = A 1 follows D' 1 = A 1 making D' valid in I. Apart 
from D' , all other axioms from KB' are contained in KB" and hence are naturally satisfied in I. So 
we find that J is a model of KB' as well. 

Second, we show that every model of FLAT(KB) can be transformed to a model of ©^(KB). We use 
the same induction strategy as above by doing iterated model transformations following the syntactic 
knowledge base conversions. Again, assume KB" is an intermediate knowledge base obtained from 
KB' by eliminating the occurrence of a ^nR.C as described above, and suppose J is a model of 
KB'. Based on I, we now (nondeterministically) construct an interpretation J~ as follows: 

• := A 1 , 

• for all C e N c , let C J := C 1 , 

• for all 5 e N R \ {R { \ 1 < i < n], let S J := S J , 

• for every 6 € (^nRA) 1 , let e 6 r . . . , ef be an exhaustive enumeration (with arbitrary but fixed 
order) of all those e e A 1 with (6, e) e R 1 and e e A 1 . Thereby 5's aforementioned concept 
membership ensures k <n. Now, let R J := {(6, e 6 ) | 6 € (^nRA) 1 }. 

Now, it is easy to see that J~ satisfies all newly introduced axioms of the shape <l/?,-.T as every 
6 has at most one 7? ; -successor (namely e^, if 6 e (^nRA) 1 , and none otherwise). Moreover, the 
axioms V/?,-.A are satisfied, as the e s have been chosen accordingly. 

Finally for all 6 e (^nRA) 1 the construction ensures 6 e (V(7? n ->Ri n . . . n ~'R n ). -A)- 7 as by 
construction, each /^-successor of 6 that lies within the extension of A is contained in e^, . . . , ef 
and therefore also /^-successor of 6 for some i. Now, again exploiting the fact that both of those 
concept expressions occur in negation normalized universal concept axioms D' e KB' and D" e 
KB" that are equal up to the substituted occurrence, we can derive that D' 1 c D"^ . Then, from 
D' 1 - A 1 follows D"^ = A^ making D" valid in J~. Apart from D" (and the newly introduced 
axioms considered above), all other axioms from KB" coincide with those from KB' and hence are 
satisfied in J~, as they do not depend on the /?, whose interpretations are the only ones changed in 
J~ compared to I. So we find that J is a model of KB". □ 
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6.6. From 3\£CITb to !A£,CIb. In the sequel, we show how the role functionality axioms of the 
shape <1 R.T can be eliminated from an SICCJ'Fb knowledge base while still preserving equisat- 
isf iability. Partially , the e mployed rewriting is the same as the one proposed for 3\£,CIT TBoxes 
by ICalvanese et al. (1998), however, in the presence of ABoxes more needs to be done. 



Essentially, the idea is to add axioms that enforce that for every functional role R, any two R- 
successors coincide with respect to their properties expressible in "relevant" DL role and concept 
expressions. To this end, we consider the parts of a knowledge base as defined in Section |2]on page 
[5] While it is not hard to see that the introduced axioms follow from R's functionality, the other 
direction (a Leibniz-style "identitas indiscernibilium" argument) needs a closer look. 
Taking an extended 3\£,CITb knowledge base KB, let @^(KB) denote the extended J[£jCIb 
knowledge base obtained from KB by removing every role functionality axiom <1 R.T and instead 
adding 

• VR.^D U VR.D for every D e P(KB \ {a e KB | a = <1 R.T for some R e R}), 

• V(7? n S).± U V(7? n -iS)._L for every atomic role S from KB, as well as 

• the DL-safe rule R(x, y), R(x, z) — > y ~ z. 

Clearly, this transformation can also be done in polynomial time and space w.r.t. the size of KB. 
Our goal is now to prove equisatisfiability of KB and @^(KB). The following lemma establishes 
the easier direction of this correspondence. 

Lemma 6.10. Any J[£jCI'Fb knowledge base KB entails all axioms of the J[£,CIb knowledge 
base ©r(KB), i.e. KB |= r (KB). 

Proof. Let J~ be a model of KB. We need to show that J~ also satisfies the additional rules and 
axioms introduced in 0^(KB). 

First let D be an arbitrary concept. Note that VR.-*D U W?.D is equivalent to the GCI 3R.D E VR.D. 
This is satisfied if, for any 8 e A^, if 8 has an /^-successor in , then all /^-successors of 8 are 
in . This is trivially satisfied if 8 has at most one /^-successor, which holds since J~ satisfies the 
functionality axiom ^IR.T e KB. Since we have shown the satisfaction for arbitrary concepts D, 
this holds in particular for those from P(KB \ [a e KB | a = <1 R.T for some R € Rj). 
Second, let S be an atomic role. Mark that V(7? n S).± U V(R n ~^S).± is equivalent to the GCI 
3(R n S).T E V(/? n -i5)._L. This means that for any 8 € A J , all /^-successors are also 5 -successors 
of it, whenever one of them is. Again, this is trivially satisfied as 8 has at most one /?-successor. 
Finally all newly introduced rules of the form R(x, y),R(x,z) — > y ~ z are satisfied in J~ as a 
consequence of the functionality statements in KB. □ 

The other direction for showing equisatisfiability, which amounts to finding a model of KB given 
one for @^(KB), is somewhat more intricate and requires some intermediate considerations. 

Lemma 6.11. If KB is an ^LCCJf r b knowledge base with <1 R.T € KB then in every model {J of 
©.p(KB) we find that (8, 8\) e R J and (8, 82) e imply 

• for all C e ,P(KB \ {a e KB | a = <1 R.T for some R e R}), we have 81 e C J iff 8 2 6 C J , 

• for all S e N R , we have (8, 8 l )eS J iff (8, 8 2 ) € S J . 

Proof. For the first proposition, assume 81 € . From (8,81) € R? follows 8 e (3R.C)^. Due to 
the r (KB) axiom VR.^CuVR.C (being equivalent to the GCI 3R.C E VR.C) follows 8 e (VR.C) J . 
Since (8, 82) e R^ , this implies 82 £ . The other direction follows by symmetry. 
To show the second proposition, assume (8,8\) e S J . Since also {8, 81) € R^, we have (8, 8\) e 
R n S J and hence 8 £ (3(R nS).T) J . From the r (KB) axiom V(R nS).±U V(/? n ->S).± (which 
is equivalent to the GCI 3(R n S).T E ->3(R n ->S).T) we conclude 8 e (^3(R n -^S).T) J , in 
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words: 5 has no /^-successor that is not its S -successor. Thus, as (6, 62) e R , it must also hold that 
(6, 82) € 5 3 . Again, the other direction follows by symmetry. □ 

In order to convert a model of @^(KB) into one of KB, we will have to enforce role functional- 
ity where needed by cautiously deleting individuals from the original model. Definition 16.131 will 
provide a method for this. To this end, some auxiliary notions defined beforehand will come in 
handy. 

Definition 6.12. Let ST be an interpretation, and let I be the unraveling of J"EB For a domain ele- 
ment cr € A J and an R e R, we define the set of R-neighbors of cr in I by nbj(cr) := {cr' | (cr, cr') e 
R 1 }. Among the R-neighbors, we distinguish between subordinate R-neighbors subj(cr) := {cr6 \ 
(o-,o~6) e R 1 } and the non-subordinate R-neighbors nonsubj(cr) := nbj(cr) \ subj(cr). 

Definition 6.13. Let ST be an interpretation, and let I be the unraveling of ST- Given an extended 
JULCITb knowledge base KB, let KB* := KB \ {a e KB | a = <1 R.T for some R e R}, let 
D := P(KB) and let S := {R | <1 R.T e KB). 

Then, an interpretation "7C will be called KB-pruning of I, if 7C can be constructed from I in the 
following way: Let first Ao = A J . Next, iteratively determine A;+i from A, as follows: 

• Select a word-length minimal cr from A,- where there is an S e S for which nb^(cr) > 1 and 
sub^(o-) > 0. 

• If nonsubj(cr) > 0, let A' = subj(cr), otherwise let A' = sub^(cr) \ {cr'} for an arbitrarily chosen 
cr' e sub^(cr). 

Delete A' from A,- as well as all cr** having some cr* e A' as prefix. 
Finally, let % be the limit of this process: A^ := HieN A; and ™ is the function - 1 restricted to A^. 

Roughly speaking, any KB-pruning of I is (nondeterministically) constructed by deleting surplus 
functional-role-successors. Mark that the tree-like structure of non-named individuals of the unrav- 
eling is crucial in order to make the process well-defined. 

Lemma 6.14. Let KB be an extended JKHCITb knowledge base, let J be a model o/Q^KB) and 
let I be an unraveling of ST- Then, any KB-pruning 'K of I is a model of KB. 

Proof. By construction, we know that J is a model of &<f(KB). Now, let 7C be a KB-pruning of 
I. For showing 7f |= KB, we divide KB into two sets, namely the set of role functionality axioms 
{a € KB I a = <1 R.T for some/? e R} and all the remaining axioms, denoted by KB*, and show 
7f |= KB* and 7C |= {a £ KB | a = ^IR.T for some fieR| separately. 

We start by showing |= KB*. To this end, we prove that, for each C € / 5 (KB*) and for every 
individual cr from 7f, we have cr e C K exactly if cr e C 1 . Clearly, this statement extends to 
concepts that are Boolean combinations of elements from /'(KB*), i.e., to all axioms in KB*. We 
omit this easy structural induction. 

The claim for C e P(KB*) is shown by induction over the depth of role restrictions in C, and we 
assume that is has already been shown for concepts of smaller role depth. We consider three cases: 

• C e N c U {T,_L} 

Then the coincidence follows directly from the construction of < K. 

• C = 3U.D 

"=>" cr € {3U.D) % means that there is a 7f -individual cr' with {cr,cr') e U % and cr' e D*. 



Remember that by construction, the individuals of I are sequences of individuals of ?J- For better readability, we 
will strictly use a - with possible subscripts - for /-individuals and 6 for ^-individuals. 
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Because of the construction of % by pruning J, this means also (cr, cr') £ U and by induction 
hypothesis, we have cr' e D 1 , ergo cr € (3U.D) 1 . 

"<=" If cr e (3U.D) 1 , there is an J-individual cr' with (cr,cr') £ U 1 and cr' £ D 1 . In case 
cr' is not deleted during the construction of 7f , it proves (by using the induction hypothesis on 
D) that cr £ (3U.D)' K . Otherwise, it must have been deleted due to the existence of another I- 
individual cr" for with Lemma |6Tl] ensures {R £ R | (cr, cr") e R 1 } = {R e R | (cr, cr') e P J } and 
e P(KB*) | cr" e £ J } = {E e P(KB*) | cr' e E 1 }. W.l.o.g., cr" does not get deleted in the 
whole construction procedure. Yet, then the Tf-individual cr" obviously proves cr e (BU.D)^. 
• C = VP.D 

"=>" Assume the contrary, i.e., cr e {SU.D) % but cr <£ (VU.D) 1 which means that there is an 
J-individual cr' with (cr, cr') e £/ J but cr' £ D 1 . In case cr' has not been deleted during the 
construction of 7C, it disproves cr £ (Vi/.D) (by invoking the induction hypothesis on D) leading 
to a contradiction. Otherwise, cr' is deleted because of the existence of another J-individual 
cr" for with Lemma [6jT] ensures {R e R | (cr,cr") £ R 1 } = {R e R | (cr,cr') £ P J } and 
{E £ P(KB*) | cr" e £ J } = {£ e P(KB*) | cr' £ £ J }. W.l.o.g., cr" does not get deleted in the 
whole construction procedure. Yet, then the TC-individual cr" obviously contradicts cr e (EW.D)^. 
"<=" Assume the contrary, i.e., cr e Q/U.D) 1 but cr g (VU.D)^ . The latter means that there is a 
7C-individual cr' with (cr, cr') e [/ and cr' g . Because of the construction of 7C by pruning 
J, this means also (cr, cr') e U 1 and cr' D J , ergo cr (VU.D) 1 , contradicting the assumption. 
We proceed by showing that every role R with <1/?.T € KB is functional in f K. Let cr e 
and let en, cr 2 be two /?-successors of cr. We consider two cases: First, assume that cr\ - and 
cr 2 - a£ for a\, a 2 6 N/. Then, by construction of the unraveling we can derive that there must be 
an 03 £ N/ with cr = a? . However, then, the DL-safe rule R(x,y),R(x, z) y ~ z from 0^(KB) 
ensures cr\ = o" 2 . Next we consider the case that at least one of cr\,cr2 is unnamed. By Lemma 
16.111 and the point- wise correspondence between J and 7C shown in the previous part of the proof, 
two statements hold: First, for all C £ P(KB*), we have that cr\ e C % iff cr 2 £ . Second, for all 
5 £ N# we have that (cr,cr\) £ iff (cr,cr2) e S . However, in the pruning process generating 
( K, exactly such duplicate occurrences are erased, leaving at most one P-successor per cr. Thus we 
conclude cr\ = cr 2 . This completes the proof that all axioms from KB are satisfied in f K. □ 

Finally, we are ready to establish the equisatisfiability result also for this last transformation step. 

Theorem 6.15. For any extended tftJjCI'Fb knowledge base KB, the J\£.CIb knowledge base 
O^(KB) and KB are equisatisfiable. 

Proof. Lemma 16.101 ensures that every model of KB is also a model of 0^(KB). Moreover, by 
Lemma l6. 141 given a model J~ for of 0^(KB), any KB-pruning of J~'s unraveling (the existence of 
which is ensured by constructive definition) is a model of KB. This finishes the proof. □ 

Eventually, the results of this section can be composed to show how to transform an extended 
S ( HIQb s knowledge base KB into an equisatisfiable extended J{£jCIb knowledge base by com- 
puting ©^^(KB) := ©^©^©^©^©^(KB). Moreover, as each of the single transformation steps 
is time polynomial, so is the overall procedure. Therefore, we are able to check the satisfiability of 
any extended S'HIQ knowledge base using the method presented in the previous sections, by first 
transforming it into tft£jCIb and then checking. 

This result is recorded in the below theorem, where we also exploit it to show an even stronger result 
about the correspondence between KB and @^q(KB). 

Theorem 6.16. Let KB be an extended S'HIQbs knowledge base. Then the following hold: 
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• KB and ©^-^(KB) are equisatisfiable, 
. KB \= C(a) ir®5wa(KB) \= C(a), 

• KB |= R(a,b) iff®s»a(KB) |= R(a,b), and 

• KB \= a * b iffQ S9{Q (KB) \=a*b, 
for any a.ieN/.Ce Nc, and R e N#. 

Proof. Equisatisfiability follows from the fact that each of the transformations 0<p, 0^, 0^, 0^, 0^ 
preserves satisfiability. We then use the established equisatisfiability of KB and &^q(KB) to 
prove the other claims. Assume KB |= C(a). This means that the knowledge base KB' obtained by 
extending KB with the DL-safe rule C(a) — > is unsatisfiable. Now we observe that ©^q(KB') is 
obtained by extending &s^q(KB) with C(a) — >. Since ©^q(KB') is unsatisfiable, so is &^q(KB) 
extended with C(a) — >, and hence 0^q(KB) |= C(a) as required. The other direction of the claim 
follows via a similar argumentation. The remaining cases are shown analogously. □ 

Consolidating all our results, we now can formulate our main theorem for checking satisfiability as 
well as entailment of positive and negative ground facts for extended S^HIQbs knowledge bases. 

Theorem 6.17. Let KB be an extended S1-lIQb s knowledge base and let 

P := DD(0< W o(KB)). 

Then the following hold: 

• KB is satisfiable iffP is, 

• KB |= C(a) iffP \= S c(a), 

• KB |= R(a,b) iffV \= S R (a,b), and 

• KB |= a x b iffF \= a « b, 

for any a,b e N/, C € Nc, and R £ Nr. 

Proof. Combine Theorem |6.16| wifh Theorem |5.4j □ 

Note also that the above observation immediately allows us to add reasoning support for DL-safe 
conjunctive queries, i.e. conjunctive queries that assume all variables to range only over named 
individuals. It is easy to see that, as a minor extension, one could generally allow for concept 
expressions WR.A and 3R.A in queries and rules, simply because DD(KB) represents these elements 
of / 5 (FLAT(T)) as atomic symbols in disjunctive Datalog. 



7. Related Work 



Boolea n constru c tors o n roles have been investigated in the context of both description and modal 
logics. iBorgidal (|1996l) used them extensively for the definition of a DL that is equivalent to the 
two-variable fragment of FOL. 



It was shown by iHustadt and Schmidt! ( 2000h that the DL obtained by aug menting J?L£C with full 



Boolean role constructors (JRTJB) is decidable. iLutz and Sattlen <|200lh established N ExpTime 



comp leteness of the standard rea soning tasks i n this logic. Restricting to only role negation (|Lutz and Sattler 
200 IF) or only role conjunction (|TobiesL l200lh retains ExpTiME-completeness. On the other hand, 
complexity does not increase beyond NExpTime eve n whe n allow ing for inverses, qualified num- 
ber restrictions, and nominals. This was shown by iTobies (2001) via a polynomial translation of 
Jl£,COIQS into C 2 , the two variable fragment o f first order logic with counting quantifiers, which 
in turn w as proven to be NExpTiME-com plete by IPratt-Hartmannl (I2005h . Also the description logic 
Jl£SO dSchmidt and TishkovskvL l2007h falls in that range of NExpTiME-complete DLs. 
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On the contrary, it was also shown by iTobiesJ d200lh that restricting to safe Boolean role constructors 
keeps J?L£C"s reasoning complexity in ExpTime, even when adding inverses and qualified number 
restrictions (JWCCUb). 

For logics including modeling constructs that deal with role composition like transitivity or - 
more general - complex role inclus ion axioms, result s on co mplexities in the presence of Boolean 
role constructors are more sparse. iLutz and Walther (2005) show that J\£C can be extended by 
negation and regular ex pressions on roles while keeping reasoning within ExpTime. Furthermore, 



Calvanese et al. (2007b) provided ExpTime complexity for a similar logic that includes inverses and 



qualified number restriction but reverts to safe negation on roles. The present work showed that rea- 
soning remains in ExpTime for extended SH.IQb s knowledge bases. Regarding DLs that combine 
nominals and role composition, it was shown that unsafe Boolean role constructors can be added to 
SHOIQ and SKOIQ (resulting in DLs SH.OIQ3, and S KOIQS,) without affec ting their respec- 
tive worst-case complexities of NExpTime and N2ExpTime ( Rudolph et al. . 2008 ah . The restriction 
to simple roles, on the other hand, is essential to retain decidability. Furthermore, conjunctions of 
simple roles (which are trivially safe in the absence of role negation) can be added to tractable DL s 
of the &£, and DLP families without increasing their worst-case complexity ((Rudolph et al.l l2008a). 

Type-based reasoning techniques have been described sporadically in the area of DLs but never 
been pract i cally adopted. 

Lutz et al. (2005) use a particular kind of types, called mosaics for finite model reasoning. Eite r et al 



(2009) use similar structures, called knots for query answering in the description logic S"HIQ. Both 
notions show a si milarity to the notion o f (counting) star types used for reasoning in fragments of 
first order logic (|Pratt-HartmannL 120051) . in that they do not only store information about single 
domain individuals but also about all their direct neighbors. As opposed to this, our notion of domi- 
noes e xhibits more s imilar ity to the notion of (non-counting) two-types used in first-order logic, 
e.g., by Gradel et~aD ( 1997 ): both notions encode information related to pairs of domain individuals 
(rather than whole neighborhoods). 

The approach of constructing a canonical model (resp. a sufficient representation of it) in a down- 
ward man ner (i.e., by pruning a larger structure) shows some similarity to Pratt's type elimination 
technique (IPrati 1 19791) . originally used to decide satisfiability of modal formulae. 
Canonical models the mselves have been a widely used notion in modal logic (IPopkornl . 1994; 
Blackburn et al. , 2001), however, due to the additional expressive power of 3\£CIb compared to 



standard modal logics like K (being the modal logic counterpart of the description logic J?L£C), 
we had to substantially modify the notion of a canonical model used there: in order to cope with 
number restrictions, we use infinite tree models based on unravelings whereas the canonical models 
in the mentioned approaches are normally finite and obtained via nitrations. 

Related in spirit (namely to use BDD-based reasoning for DL re asoning tasks an d to use a type 
elimination-like technique for doing so) is the work presented by IPan et al.1 (120060 . However, the 
established results as well as the approaches differ greatly from ours: the authors establish a proce- 
dure for deciding the satisfiability of J{JjC concepts in a setting not allowing for general TBoxes, 
while our approach can check satisfiability of S'H IQ (resp. Jl£,CI b) knowledge bases supporting 
general TBoxes, thereby generalizing the results by lPan et al. (2006) significantly. 

The presented method for reasoning with DL-safe r ules and assertional data exhibits simila rities 
to the algorithm underlying the KAON2 reasoner (|MotikL 120061 : iHustadt et ail 120071 . 12008b . In 
particular, pre-transformations are first applied to S'HIQ knowledge bases, before a saturation 
procedure is applied to the TBox part that results in a disjunctive Datalog program that can be 
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combined with the assertional part of the knowledge base. As in our case, extensions with DL- 
safe rules and ground conjunctive queries are possible. The processing presented here, however, 
is very different from KAON2. Besides using OBDDs, it also employs Boolean role constructors 
that admit an indi rect encoding of number restrictions. Moreover, as opposed to our approach, the 
transformation in lMotikl (120061) does not preserve all ground consequences: S'HIQ consequences 
of the form R(a, b) with R being non-simple may not be entailed by the created Datalog program. 
This shortcoming could, however, be easily corrected along the lines of our approach. On the other 
hand, the KAON2 transformation avoids the use of disjunctions in Datalog for knowledge bases 
that are Horn (i.e., free of disjunctive inf ormation). Reasoning for Hom-S'HIQ can thus be done 
in ExpTime, which is worst-case optimal ( Krotzsch et al. , 2012b . In contrast, our OBDD encoding 
requires disjunctive Datalog in all cases, leading to a NExpTime procedure even for Hom-S'HIQ. 



8. Discussion 

We have presented a new worst-case optimal reasoning algorithm for standard reasoning tasks for 
extended SfilQbs knowledge bases. The algorithm compiles SfilQbs terminologies into disjunc- 
tive Datalog programs, which are then combined with assertional information and DL-safe rules for 
satisfiability checking and (ground) query answering. To this end, OBDDs are used as a conve- 
nient intermediate data structure to process terminologies and are subsequently transformed into 
disjunctive Datalog programs that can naturally account for ABox data and DL-safe rules. The 
generation of disjunctive Datalog may require exponentially many computation steps, the cost of 
which depends on the concrete OBDD implementation at hand - finding optimal OBDD encodings 
is NP-complete but heuristic approximations are often used in practice. Querying the disjunctive 
Datalog program then is co-NP-comple te w.r.t. the size of the ABox, so that the data complexity 
of the algorithm is worst-case optimal (IMotikL l2006h . Concerning combined complexity of testing 



the satisfiability of extended knowledge bases, the ExpTime OBDD construction step dominates the 
subsequent disjunctive Datalog reasoning part, so the overall combined complexity of the algorithm 
is ExpTime resulting in worst-case optimality for this case as well, given the ExpTiME-hardness of 
satisfiability checking in S^HIQb s . 

It is also worthwhile to briefly discuss the applicability of our method to knowledge bases featuring 



so-cal led complex role inclusion axioms (RIAs). By means of techniques described by iKazakov 



d2008l) . any (pure, that is, non-extended) S"RIQb s knowledge base can be transformed into an eq 



uisatsfiable tftJXfHIQb knowledge base, however, like Motik's original transitivity elimination, 
this transformation does not preserve all ground consequences. Consequently, it is not satisfiability- 
preserving for extended S'RIQbs knowledge bases. Still, capitalizing on these RIA-removal tech- 
niques, our method provides a way for satisfiability checking for S'RIQbs knowledge bases without 
DL-safe rules that is worst-case optimal w.r.t. both combined and data complexity. We believe, 
however, that it would be not to hard a task to modify the transformation to even preserve ground 
consequences. 

For future work, the algorithm needs to be evaluated in practice. A crude prototype implementa- 
tion was used to generate the examples within this paper, and has shown to outperform tableaux 
reasoners in certain handcrafted cases, but more extensive evaluations with an optimized implemen- 
tation on real-world ontologies are needed for a conclusive statement on the practical potential of 
this new reasoning strategy. It is also evident that redundancy elimination techniques are required 
to reduce the number of generated Datalog rules, which is also an important aspect of the KAON2 
implementation. 
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Another avenue for future research is the extension of the approach to more modeling features such 
as role chain axioms and nominals - significant revisions of the model-theoretic considerations are 
needed for these cases. 
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